Gene EcolC_3322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3322 
Symbol 
ID6067220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3640896 
End bp3645152 
Gene Length4257 bp 
Protein Length1418 aa 
Translation table11 
GC content52% 
IMG OID641602737 
ProductIg domain-containing protein 
Protein accessionYP_001726270 
Protein GI170021316 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACATT ATAAAACAGG TCATAAACAA CCACGATTTC GTTATTCAGT TCTGGCCCGC 
TGCGTGGCGT GGGCAAATAT CTCTGTTCAG GTTCTTTTTC CACTCGCTGT CACCTTTACC
CCAGTAATGG CGGCACGTGC GCAGCATGCG GTTCAGCCAC GGTTGAGCAT GGGAAATACT
ACGGTAACTG CTGATAATAA CGTGGAGAAA AATGTCGCGT CGTTTGCCGC AAATGCCGGG
ACATTTTTAA GCAGTCAGCC AGATAGCGAT GCGACACGTA ACTTTATTAC CGGAATGGCC
ACAGCTAAAG CTAACCAGGA AATACAGGAG TGGCTCGGGA AATATGGTAC TGCGCGCGTC
AAACTGAATG TCGATAAAGA TTTCTCGCTG AAGGATTCTT CGCTGGAAAT GCTTTATCCG
ATTTATGATA CGCCAACAAA TATGTTGTTC ACTCAGGGAG CAATACATCG TACCGACGAT
CGTACTCAGT CAAATATTGG TTTTGGCTGG CGTCATTTTT CAGGAAATGA CTGGATGGCG
GGGGTGAACA CCTTTATCGA CCATGATTTA TCCCGTAGTC ATACCCGCAT TGGTGTTGGT
GCGGAATACT GGCGCGATTA TCTGAAACTG AGCGCCAATG GTTATATCCG GGCTTCTGGC
TGGAAAAAAT CGCCGGATGT TGAGGATTAT CAGGAACGCC CGGCGAATGG CTGGGATATT
CGTGCTGAGG GCTATTTACC CGCCTGGCCG CAGCTTGGCG CAAGCCTGAT GTATGAACAG
TATTATGGCG ATGAAGTCGG GCTGTTTGGT AAAGATAAGC GCCAGAAAGA CCCGCATGCT
ATTTCTGCCG AGGTGACCTA TACGCCAGTG CCTCTTCTGA CACTGAGCGC CGGGCATAAG
CAGGGCAAGA GTGGTGAGAA TGACACTCGC TTTGGCCTGG AAGTTAATTA TCGGATTGGC
GAACCTCTGG CGAAACAACT CGATACAGAC AGCATTCGCG AGCGTCGGGT ACTGGCAGGC
AGCCGCTATG ACCTGGTTGA GCGTAATAAC AACATCGTTC TTGAGTACCG CAAATCTGAA
GTGATCCGTA TTGCTCTGCC TGAACGTATT GAAGGTAAGG GTGGTCAGAC ACTTTCCCTG
GGGCTTGTGG TCAGCAAAGC AACTCACGGA CTGAAAAATG TGCAGTGGGA AGCGCCGTCA
TTACTGGCTG AGGGTGGCAA AATTACCGGT CAGGGTAGTC AGTGGCAAGT AACGCTCCCG
GCTTATCGTC CAGGCAAAGA CAATTATTAT GCGATTTCTG CGGTTGCCTA CGATAACAAA
GGCAATGCCT CAAAACGCGT GCAGACAGAG GTGGTCATTA CCGGAGCAGG TATGAGCGCC
GATCGCACGG CGTTAACGCT TGACGGTCAG AGCCGTATTC AAATGCTTGC TAACGGTAAT
GAGCAAAGAC CGCTGGTGCT GTCTCTGCGC GACGCCGAGG GGCAGCCAGT CACGGGCATG
AAAGATCAGA TCAAGACTGA ACTAGCCTTC AAACCGGCTG GAAATATTGT GACTCGTTCC
CTGAAGGCCA CTAAATCACA GGCAAAGCCA ACACTGGGTG AGTTCACCGA AACTGAAGCA
GGGGTGTATC AGTCTGTCTT TACTACCGGA ACGCAGTCAG GTGAGGCAAC GATTACTGTT
AGCGTTGATG GCATGAGCAA AACCGTCACT GCAGAACTGC GGGCCACGAT GATGGATGTG
GCAAACTCCA CCCTGAGCGC TAACGAGCCG TCAGGTGATG TGGTTGCTGA TGGTCAGCAA
GCCTATACGT TGACGTTGAC TGCGGTGGAC TCCGAGGGTA ATCCGGTGAC GGGAGAAGCC
AGCCGCTTGC GATTTGTTCC GCAAGACACT AATGGTGTAA CCGTTGGTGC CATTTCGGAA
ATAAAACCAG GCGTTTACAG CGCCACGGTT TCTTCGACCC GTGCCGGAAA CGTTGTTGTG
CGTGCTTTCA GCGAGCAGTA TCAGCTGGGC ACATTACAAC AAACGCTGAA GTTTGTTGCC
GGTCCGCTTG ATGCAGCACA TTCGTCCATC ACCCTGAATC CTGATAAACC GGTGGTTGGC
GGTACAGTTA CGGCAATCTG GACGGCAAAA GATGCCTATG ACAACCCTGT GACCAGCCTC
ACGCCGGAAG CGCCGTCATT AGCGGGTGCC GCTGCTGTAG GTTCTACGGC ATCTGGCTGG
ACAAATAATG GTGATGGGAC GTGGACTGCG CAGATTACTC TCGGCTCTAC GGCGGGTGAA
TTAGAAGTTA TGCCGAAGCT AAATGGACAG GATGCGGCAG CAAATGCGGC AAAAGTAACC
GTGGTGGCTG ATGCGTTATC TTCAAACCAG TCGAAAGTCT CTGTCGCAGA AGATCACGTA
AAAGCCGGCG AAAGCACAAC CGTGACGCTT ATTGCAAAAG ATGCACATGG CAACACTATC
AGTGGTCTTT CGTTGTCGGC AAGTTTGACG GGGACCGCCT CTGAAGGGGC GACCGTTTCC
AGTTGGACCG AAAAAGGTGA CGGTTCCTAT GTTGCTACGT TAACTACAGG CGGAAAGACG
GGCGAGCTTC GTGTCATGCC GCTCTTCAAC GGCCAGCCAG CAGCCACCGA AGCCGCGCAG
TTGACGGTCA TCGCCGGAGA GATGTCATCA GCGAACTCTA CGCTTGTTGC GGCCAATAAG
GCTCCGACCG TCAAAATGAC GACGGAACTC ACCTTCACCG TGAAGGATGC GTACGGGAAC
CCGGTCACCG GGCTGAAGCC AGATGCACCA GTGTTTAGCG GTGCCGCCAG CACGGGGAGT
GAGCGTCCTT CAGCAGGAAA CTGGACAGAG AAAGGTAATG GGGTCTACGT GGCGACCTTA
ACGCTGGGAT CTGCCGCGGG TCAGTTGTCT GTGATGCCGC GAGTGAACGG CCAAAATGCC
GTTGCTCAGC CACTGGTGCT GAACGTTGCA GGTGACGCAT CTAAGGCTGA GATTCGTGAT
ATGACAGTGA AGGTTAATAA CCAACTGGCT AATGGACAGT CTGCTAACCA GATAACCCTG
ACCGTCGTGG ACAGCTATGG TAACCCGTTG CAGGGGCAAG AAGTTACGCT GACTTTACCG
CAGGGTGTGA CCAGCAAGAC GGGGAATACA GTAACAACCA ATGCGGCAGG GAAAGTGGAC
ATTGAGCTTA TGTCAACGGT TGCGGGGGAA CACAGCATCA CGGCCTCAGT GAATAATGCT
CAGAAGACGG TTACGGTGAA ATTCAAGGCG GATTTCAGTA CCGGTCAGGC GACCCTGGAG
GTTGATGGCA GCACGCCAAA AGTGGCAAAC GACAATGATG CCTTTACGCT GACGGCAACG
GTTAAGGATC AATACGGCAA CCTTCTGCCT GGCGCTGTGG TCGTCTTTAA TCTGCCTTGG
GGCGTCAAAC CGCTTGCAGA CGGTAATATC ATGGTGAACG CCGACAAGGA GGGTAAAGCG
GAACTGAAAG TGGTCTCCGT GACTGCCGGA ACGTATGAGA TCACGGTGTC GGCAGGAAAT
GACCAGCCTT CGAATGCGCA GTCTGTAACG TTTGTGGCCG ATAAGACTAC GGCGACCATC
TCCAGTATTG AGGTGATTGG CAACCGTGCA GTGGCGGATG GCAAAACCAA ACAGACGTAT
AAAGTTACGG TGACTGATGC CAATAACAAC CTGTTGAAGG ATAGCGACGT GACGCTGACT
GCCAGCTCGG AAAATTTAGT TCTGGATCCT AAAGGGACGG CGAAAACTAA TGAGCAAGGA
CAGGCTGTTT TCACCGGCTC TACCACTATC GCAGCGACAT ATACACTCAC GGCGAAAGTG
GAACAGGCCA ACGGTCAGGT ATCGACGAAA ACTGCTGAAT CTAAATTCGT CGCGGATGAT
AAAAACGCGG TGCTCGCCGC ATCTCCAGAA CGTGTAGATT CTCTGGTGGC GGACGGGAAG
ACTACTGCAA CAATGACGGT TACCCTGATG GCGGGAGTCA ATCCCGTAGG AGGAAGTATG
TGGGTCGACA TTGAGGCTCC GGAAGGAGTG ACGGAGAAGG ATTATCAATT CCTGCCGTCG
AAGGCTGACC ATTTCTCAGG TGGGAAAATC ACGCGTACAT TTAGTACCAG CAAGCCAGGT
GTCTATACGT TCACATTCAA CGCACTGACG TATGGCGGGT ACGAAATGAC GCCTGTGAAG
GTGACAATTA ACGCCGTTGC TGCAGAGACT GAAAATGGCG AGGAGGAGAT GCCATAA
 
Protein sequence
MSHYKTGHKQ PRFRYSVLAR CVAWANISVQ VLFPLAVTFT PVMAARAQHA VQPRLSMGNT 
TVTADNNVEK NVASFAANAG TFLSSQPDSD ATRNFITGMA TAKANQEIQE WLGKYGTARV
KLNVDKDFSL KDSSLEMLYP IYDTPTNMLF TQGAIHRTDD RTQSNIGFGW RHFSGNDWMA
GVNTFIDHDL SRSHTRIGVG AEYWRDYLKL SANGYIRASG WKKSPDVEDY QERPANGWDI
RAEGYLPAWP QLGASLMYEQ YYGDEVGLFG KDKRQKDPHA ISAEVTYTPV PLLTLSAGHK
QGKSGENDTR FGLEVNYRIG EPLAKQLDTD SIRERRVLAG SRYDLVERNN NIVLEYRKSE
VIRIALPERI EGKGGQTLSL GLVVSKATHG LKNVQWEAPS LLAEGGKITG QGSQWQVTLP
AYRPGKDNYY AISAVAYDNK GNASKRVQTE VVITGAGMSA DRTALTLDGQ SRIQMLANGN
EQRPLVLSLR DAEGQPVTGM KDQIKTELAF KPAGNIVTRS LKATKSQAKP TLGEFTETEA
GVYQSVFTTG TQSGEATITV SVDGMSKTVT AELRATMMDV ANSTLSANEP SGDVVADGQQ
AYTLTLTAVD SEGNPVTGEA SRLRFVPQDT NGVTVGAISE IKPGVYSATV SSTRAGNVVV
RAFSEQYQLG TLQQTLKFVA GPLDAAHSSI TLNPDKPVVG GTVTAIWTAK DAYDNPVTSL
TPEAPSLAGA AAVGSTASGW TNNGDGTWTA QITLGSTAGE LEVMPKLNGQ DAAANAAKVT
VVADALSSNQ SKVSVAEDHV KAGESTTVTL IAKDAHGNTI SGLSLSASLT GTASEGATVS
SWTEKGDGSY VATLTTGGKT GELRVMPLFN GQPAATEAAQ LTVIAGEMSS ANSTLVAANK
APTVKMTTEL TFTVKDAYGN PVTGLKPDAP VFSGAASTGS ERPSAGNWTE KGNGVYVATL
TLGSAAGQLS VMPRVNGQNA VAQPLVLNVA GDASKAEIRD MTVKVNNQLA NGQSANQITL
TVVDSYGNPL QGQEVTLTLP QGVTSKTGNT VTTNAAGKVD IELMSTVAGE HSITASVNNA
QKTVTVKFKA DFSTGQATLE VDGSTPKVAN DNDAFTLTAT VKDQYGNLLP GAVVVFNLPW
GVKPLADGNI MVNADKEGKA ELKVVSVTAG TYEITVSAGN DQPSNAQSVT FVADKTTATI
SSIEVIGNRA VADGKTKQTY KVTVTDANNN LLKDSDVTLT ASSENLVLDP KGTAKTNEQG
QAVFTGSTTI AATYTLTAKV EQANGQVSTK TAESKFVADD KNAVLAASPE RVDSLVADGK
TTATMTVTLM AGVNPVGGSM WVDIEAPEGV TEKDYQFLPS KADHFSGGKI TRTFSTSKPG
VYTFTFNALT YGGYEMTPVK VTINAVAAET ENGEEEMP