Gene Shewmr4_1792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1792 
Symbol 
ID4252366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2127505 
End bp2128692 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content52% 
IMG OID638118403 
Productaromatic amino acid transporter 
Protein accessionYP_733923 
Protein GI113970130 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAAA ATAAATTTTT CGGTAGTTTG CTACTGATTG CAGGCACCAC CATTGGCGCG 
GGTATGCTCG CACTCCCTAT CGCCTCGGCA GGACTCGGTT TTGGTGTATC GAGCATCATT
ATGTTGCTCC TCTGGGCGCT GATGGCCTAC ACCGCCCTGC TGATGGTTGA AATCCATCAA
TTTGCCCCGA GTGATGCGAG CCTGAACCAA TTAGCGCGCA CGCTTTTGGG CGCTAAGGGC
CAAGTGATTG CCAGTGTTGC CCTGATGTTT TTACTCTACG CCCTGTGCGC CGCCTATATC
GCGGGTGGCG GCGAGCAAGT CAATCAAAAG CTCAATGCTT GGTTAGGATT AAATCTTCCG
CCACAGGCGG GCGCCATCTT CTTTACCCTG TTAGTCAGCA CCATTGTCGG CTTAGGCACC
CATTGTGTCG ATTTGATTAA TCGCGTGCTC TTTAGTTTGA AAATCATTGC GTTAATCCTA
ATGCTGGCTT TATTACTGCC ACAGGTTGAA GGCACACATT TACTCGAACT GCCGCTAGAG
CAAGGGCTTA TCGTGTCAGC CATACCTGTG ATTTTTACCT CCTTTGGTTT TCATGGTTCG
ATTCCATCCG TGGTGCGCTA CTTAGGCGTT GAGGTAAAAA GCCTGCGTAA AATCATGCTG
CTCGGCTCGG CGTTACCACT GCTCATTTAC CTGCTGTGGC AACTGGGCAG TCAAGGCGTA
CTCAGTCAAA GCCAACTGAT GACGAATCAG AGTCTTTCGG GCTTTATCAA TCAGTTAGCC
AGTGTATTGC ACAGCCAATA CTTAAGTTCT GCCATCAGTG TATTTGCCGA TCTGGCGCTG
GCCACCTCCT TTTTAGGGGT GAGCCTCGGT CTGTTTGACT TTATGGCGGC TAACTTAAGG
CAGCAGGATA ATGCCGTGGG TCGCAGTGTT ACCGCCGCCA TTACCTTCGT ACCTCCTCTG
GGGTTTGCCC TCTTTTACCC GCAGGGATTT ATTACCGCCC TCGGTTATGC GGCAATCGCC
CTCGTGATCC TCGCGATTTT TTTACCCGTG ACCATGGTGT GGGTTCAAAG ACAAACGCGC
GATAAGGCGA ATCTGCCACA GGGTTACCGC GTCGCAGGGG GGAAGCTCGG TTTACTGTTG
GCAATGCTCT GCGGAGTGGC CGTGATTGGC GCTCAGCTCT TGGGATAA
 
Protein sequence
MTQNKFFGSL LLIAGTTIGA GMLALPIASA GLGFGVSSII MLLLWALMAY TALLMVEIHQ 
FAPSDASLNQ LARTLLGAKG QVIASVALMF LLYALCAAYI AGGGEQVNQK LNAWLGLNLP
PQAGAIFFTL LVSTIVGLGT HCVDLINRVL FSLKIIALIL MLALLLPQVE GTHLLELPLE
QGLIVSAIPV IFTSFGFHGS IPSVVRYLGV EVKSLRKIML LGSALPLLIY LLWQLGSQGV
LSQSQLMTNQ SLSGFINQLA SVLHSQYLSS AISVFADLAL ATSFLGVSLG LFDFMAANLR
QQDNAVGRSV TAAITFVPPL GFALFYPQGF ITALGYAAIA LVILAIFLPV TMVWVQRQTR
DKANLPQGYR VAGGKLGLLL AMLCGVAVIG AQLLG