Gene Ssed_3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_3938 
Symbol 
ID5613444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp4810365 
End bp4811543 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content51% 
IMG OID640934892 
Productaromatic amino acid permease 
Protein accessionYP_001475670 
Protein GI157377070 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.176846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.472237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGCT CTATCGCTAT CGTTGCTGGT ACCGCCATCG GTGCCGGAAT GTTAGCGCTG 
CCTCTGGCCA CCGCTGCTCT TGGGATCATT CCTGCCATAC TCTTACTCGT TCTCATTTGG
GGCGTCTCTG CCTACACCTC ACTACTGATG CTGGAGATTA ATCTTCGTAC CGGCGTGGGT
GATAATGTTC ATGCCATTAC CGGTAAAGTT TTGGGTAAGA GAGGTCAGCT TGTTCAGGGG
GCTTCCTTCT TAAGCCTACT GTTTGCCCTG ACTGCGGCAT ATCTAACCGG TGGTTCATCT
CTATTGGTTC TGAAAGCCGA AACCATGTTC GATGTCGCGT TAGATAACCA AGCGGCAGTG
ATCTTATTCA CTCTGACGCT TGGACTGTTT GCCGCCTTCG GTGTGGCCTG GGTCGATAAG
GTTTCCCGTA TCCTCTTCTC TTTGATGATT GCGCTTTTGG TCGTCACCGT TGGATTCTTA
ATGCCTGAAG TTAGCCCATC TAAGATGGCG GTTGCCGCGA TGGAAAGGGG TAACTTCGAT
GTCTGGATGG CAGCTATCCC TGTGGTATTT ACCTCATTTG GTTTTCATGT CTGTATCGCA
ACCTTAGTGC GCTATCTTGA CGGCGATACT ATGACACTGC GTAAGGTGCT CTTGATTGGC
TCGACGCTGC CTCTGCTATG TTATGTCTTA TGGTTATTGG TAACACTGGG TACCGTTGGC
GGTGAAGCCA TTCATGGGTT TGGTGGTTCA TTGCCGGCAC TGGTGAGTTC TCTGCAGGAT
ATTGCGGCTC AGCCATGGGT GAGCAGGTGT ATCTCGCTAT TTGCAGACTT TGCCTTAGTG
ACCTCTTTCC TCGGTGTGAC CCTAAGCCTA TATGACTTCA TCGGAGAGTT AACCCGCGCC
AGACCTACCA TTGCGGGTCG TATTCAAACC TGGTTGATCA CCTTCATCCC TCCGGTCTTG
TGTGCACTCT ATATTCCGGA AGGATTCGTT GCGGTATTGG GTTTCGCGGC AATTCCTTTG
GTCGTGATGA TCATCTTCCT GCCAATAGTG ATGGCGTTAA AGCAACGCCC ACAGGCAACA
GAGAATGGGT ATCAGGTAGC GGGTGGAACT CCTGCGCTAG CCATAGCCGG CGTGTTAGGC
ACAGTGATTA TTGCCTCACA ACTTTGGGTC GCTCTATAG
 
Protein sequence
MLGSIAIVAG TAIGAGMLAL PLATAALGII PAILLLVLIW GVSAYTSLLM LEINLRTGVG 
DNVHAITGKV LGKRGQLVQG ASFLSLLFAL TAAYLTGGSS LLVLKAETMF DVALDNQAAV
ILFTLTLGLF AAFGVAWVDK VSRILFSLMI ALLVVTVGFL MPEVSPSKMA VAAMERGNFD
VWMAAIPVVF TSFGFHVCIA TLVRYLDGDT MTLRKVLLIG STLPLLCYVL WLLVTLGTVG
GEAIHGFGGS LPALVSSLQD IAAQPWVSRC ISLFADFALV TSFLGVTLSL YDFIGELTRA
RPTIAGRIQT WLITFIPPVL CALYIPEGFV AVLGFAAIPL VVMIIFLPIV MALKQRPQAT
ENGYQVAGGT PALAIAGVLG TVIIASQLWV AL