Gene Shewmr4_3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3787 
Symbol 
ID4254350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4523295 
End bp4524551 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content49% 
IMG OID638120432 
Productaromatic amino acid transporter 
Protein accessionYP_735907 
Protein GI113972114 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAACC ATATGAATGC GGCGAAGAAT AAGCCGGTGG GCAAGTCTTT GCTCGGTGGT 
GCCATGATTA TTGCCGGCAC CACAGTCGGG GCGGGGATGT TTTCTTTGCC CGTTGTGGGC
GCTGGCATGT GGTTTGGCTA CTCGGTACTG ATGTTACTCG GTATTTGGTT CTGCATGTTG
ATGTCGGGAT TGTTACTGCT CGAAACCAAC CTGCATTTCG AACCGGGAGC GAGTTTCGAT
ACCCTGACCA AGGAAACTCT AGGTCAGTTT TGGCGGATCG TGAATGGGGT ATCGATTGCC
TTCGTGCTTT ACATCTTAAC CTACGCCTAC ATCAGTGGCG GCGGCTCGAT TGTTAACCAT
AGCCTACAGG GTATGGGGAT TGAGTTACCC CAGAGTGTTG CTGGATTAGT ATTTGCGGTT
GTGCTGGCTT GTATCGTGTT GATCAGTACC AAGGCGGTGG ACCGTATTAC GACCATAATG
CTCGGCGGTA TGATTATTAC TTTCTTCCTC GCGATTGGTA ATTTACTGAT CGAGATTGAT
GTGACTAAGT TGCTCGAACC GGATGGCAAT CAAAGCTTTA TGCCTTATCT GTGGGCTGCT
TTGCCCTTTG GCTTGGCAAG TTTTGGTTAC CATGGCAATG TGCCGAGTCT GGTGAAGTAC
TATGGCAAAG ATTCATCGAC CATCATCAAA GCGATTTTCG TCGGCACTTT CATTGCGCTG
ATCATTTATG CCTGCTGGTT AGTCGCCACT ATGGGCAACA TTCCCCGCAG CCAATTTAGT
GAGATTATTG CCCAGGGTGG CAATATGGGT GTGTTAGTCG GTGCCTTGTC GAAAGTGATG
GAAAGTAGCT GGTTAAACAG CATGTTAACC CTGTTTGCTA ACTTGGCTGT GGCGTCTTCA
TTCCTTGGCG TGACCTTAGG CTTATTCGAT TATTTGGCCG ATTTATTCGG GTTCGACGAT
ACGCGCTCGG GGCGGATGAA AACGGCAATT GTCACCTTTG TGCCGCCGAC CATATTAGGT
TTGCTGTTCC CCGATGGTTT TTTGGTTGCC ATTGGTTTCG CAGCCTTAGC GGCAACCGTT
TGGGCCGTCA TAGTGCCCGC TCTGATGGCC TATAAATCGA GACAGCAGTT CCCTAACCAT
CAAGGTTTTA GAGTCTTTGG TGGCACGCCC TTAATTATCC TTGTGGTGCT ATATGGCGTG
GTGACAGGGG CTTGTCATCT ACTCGCGATG GCGAATTTAT TGCCGCAGTT CTCCTAA
 
Protein sequence
MANHMNAAKN KPVGKSLLGG AMIIAGTTVG AGMFSLPVVG AGMWFGYSVL MLLGIWFCML 
MSGLLLLETN LHFEPGASFD TLTKETLGQF WRIVNGVSIA FVLYILTYAY ISGGGSIVNH
SLQGMGIELP QSVAGLVFAV VLACIVLIST KAVDRITTIM LGGMIITFFL AIGNLLIEID
VTKLLEPDGN QSFMPYLWAA LPFGLASFGY HGNVPSLVKY YGKDSSTIIK AIFVGTFIAL
IIYACWLVAT MGNIPRSQFS EIIAQGGNMG VLVGALSKVM ESSWLNSMLT LFANLAVASS
FLGVTLGLFD YLADLFGFDD TRSGRMKTAI VTFVPPTILG LLFPDGFLVA IGFAALAATV
WAVIVPALMA YKSRQQFPNH QGFRVFGGTP LIILVVLYGV VTGACHLLAM ANLLPQFS