Gene EcE24377A_3627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3627 
Symbol 
ID5590358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3628522 
End bp3629613 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content48% 
IMG OID640927251 
Productputative fimbrial protein 
Protein accessionYP_001464620 
Protein GI157156670 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAG CGCCTCTTAT AACAGGACTT TTGTTGATAT CCACATCCTG CGCTTATGCC 
TCCTCAGGAG GGTGTGGAGC TGACAGCACT AGCGGTGCGA CAAATTACAG CAGTGTGGTT
GATGATGTTA CGGTGAACCA GACAGATAAC GTGACAGGAC GGGAGTTTAC CTCTGCCACG
CTAAGTAGCA CTAACTGGCA ATACGCCTGT ACCTGCTCTG CGGGTAAGGC AGTTAAACTT
GTCTATATGG TCAGCCCTGT ACTTACCACC ACTGGACATC AAGCAGGATA TTACAAACTC
AATGATAGTC TGGATATTAA AACCACATTA CAGGCGAATG ACATTCCAGG ACTCGTGACT
GACCAGACCG TTTCTGTTAA CACCCGATTC ACACAGATAA AAAGCAACAC CGTATATTCT
GCTGCAACCC AAACGGGTGT TTGCCAGGGT GACACGTCTC GTTATGGACC CGTTAATATT
GGTGCAAATA CCACCTTTAC CCTGTATGTC ACCAAGCCAT TTCTCGGCTC GATGACCATT
CCGAAAACGG ATATTGCCGT CATTAAAGGC GCATGGGTCG ATGGAATGGG AAGCCCGTCT
ACAGGTGACT TCCATGATTT AGTCAAGTTA TCGATTCAGG GAAATCTCAC CGCCCCACAG
TCGTGCAAAA TTAATCAGGG CGATGTTATT AAGGTTAATT TTGGATTCAT CAATGGTCAG
AAGTTTACCA CCCGCAATGC CATGCCAGAC GGTTTTACTC CAGTAGACTT TAATATCACT
TATGACTGTG GTGATACTTC AAAGATTAAA AACTCGTTGC AAATGCGCAT CGACGGTACA
ACTGGGGTAG TAGACCAGTA CAACCTGGTC GCCAGACGAA GAAGTTCAGA CAATGCGCCC
GATGTCGGTA TTCGTATTGA AAATCTCGGC GGCGGAGTTG CAAATATTCC TTTTCAGAAC
GGTATCCTTC CCGTTGATCC TTCCGGGCAT GGCACCATCA ACATGCGCGC CTGGCCAGTT
AATCTGGTCG GTGGTGAGCT GGAAACAGGA AAATTTCAGG GCACCGCCAC CATTACCGTC
ATCGTGCGGT AA
 
Protein sequence
MKRAPLITGL LLISTSCAYA SSGGCGADST SGATNYSSVV DDVTVNQTDN VTGREFTSAT 
LSSTNWQYAC TCSAGKAVKL VYMVSPVLTT TGHQAGYYKL NDSLDIKTTL QANDIPGLVT
DQTVSVNTRF TQIKSNTVYS AATQTGVCQG DTSRYGPVNI GANTTFTLYV TKPFLGSMTI
PKTDIAVIKG AWVDGMGSPS TGDFHDLVKL SIQGNLTAPQ SCKINQGDVI KVNFGFINGQ
KFTTRNAMPD GFTPVDFNIT YDCGDTSKIK NSLQMRIDGT TGVVDQYNLV ARRRSSDNAP
DVGIRIENLG GGVANIPFQN GILPVDPSGH GTINMRAWPV NLVGGELETG KFQGTATITV
IVR