Gene Shewmr4_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2234 
Symbol 
ID4252806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2670121 
End bp2671137 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content37% 
IMG OID638118859 
Productfimbrial protein 
Protein accessionYP_734363 
Protein GI113970570 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.22557 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0194534 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTTG CAATAAAGTT TCGTTTTTGT GTTTTTTGTT TACTTTTATT TTTCAGCGAA 
AAAATATTGG CATATTGTAC AAGTGTTGGG GCTCCTAATG TATATCTAAA TGCAAGTATT
AATATGGGTG ATGTATCTCA AGGGGATATG AGTGACTCTA GTGTATTTAC TTCTGATTGG
TATACAACTG GAGGAAATGG GTTGTACTGG ACCTTTTATG GTTGCTCTTC AGCAGGAAGT
TCACATTGGT TTTACTCCCC TAATTCGCAA ATAGAAACTA TTGATTCTAC CGCGATGTTT
GCTACTAATA ATCCGAATGT TTTTTATGAA CTATGGGTAA AAGATAGTGG TGGTCCTTGG
ATTAATTTGG CTACTATATC AAGTAGACAT GTTAATTTAC GTAATCATCC GGATGGTAAC
GGCTGGTACT CGGCACACTA CAAAACTCGA TTTCATGTAA GGGGAGTTAT CGATAAAGGA
ATTCATCAAG TTTATGGAGG ATTAGGGTAT GCATATTTAT CTGAGAATCA ATACTCAAAT
AATGGCTATG TAAGTCGGGA CGTTTCATAT TTATTTACAG TGAATGTCAA CCCAGGTACA
TGTTCTTTTT CAACGGAAGA TGATAATAAG GTTGTTAATT TGCCAACTGT GAATGTTGGT
GATTTTAATG GTGTTGGAAC TGGCGCTGGG GAAAAAAACT TTAACCTGAA TGTGACTTGT
CAAAGTGGAA CAAAAGCTTA TATCAGTTTT TCAGATGCTT ATAACTCGAG TAACTCTAGC
GATTCTCTAT CTACATTGAC CTCAACGACA TCTGCCGATG GGGTTAATAT ACAGATTCTA
TCGAACAACT GGGGGGGGGC AATTGTACTA AATACACCAT TTTATGTTGT TGGTGCTGAT
GACTCGGCCG CTGCGAGTTA CACCATTCCA TTTTCAGCTA GATATATACA AGTGGACCCA
GTTTTGACTG CAGGAACAGT TGAGGCAAAA ACAATAGTTG TTATGACTTA TGAATAG
 
Protein sequence
MFFAIKFRFC VFCLLLFFSE KILAYCTSVG APNVYLNASI NMGDVSQGDM SDSSVFTSDW 
YTTGGNGLYW TFYGCSSAGS SHWFYSPNSQ IETIDSTAMF ATNNPNVFYE LWVKDSGGPW
INLATISSRH VNLRNHPDGN GWYSAHYKTR FHVRGVIDKG IHQVYGGLGY AYLSENQYSN
NGYVSRDVSY LFTVNVNPGT CSFSTEDDNK VVNLPTVNVG DFNGVGTGAG EKNFNLNVTC
QSGTKAYISF SDAYNSSNSS DSLSTLTSTT SADGVNIQIL SNNWGGAIVL NTPFYVVGAD
DSAAASYTIP FSARYIQVDP VLTAGTVEAK TIVVMTYE