Gene Shewmr4_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3026 
Symbol 
ID4253597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3618198 
End bp3619373 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content51% 
IMG OID638119668 
Productaromatic amino acid permease 
Protein accessionYP_735154 
Protein GI113971361 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGGCT CGATAGCCAT TGTCGCGGGG ACCGCCATTG GCGCGGGAAT GTTAGCCTTA 
CCCTTAGCCA CGGCCGCCTT AGGCATGGTG CCAGCCATTT TGTTAATGGT GGTGATTTGG
GGCTTGTCAG CCTATACCTC ATTGTTAATG CTTGAGATTA ACCTGCGCTC AGGCGTTGGT
GATAACGTCC ACGCCATCAC GGGCAAACTC CTCGGCAAGA AAGGCCAAAT GGTGCAAGGC
GCCTCCTTTC TCAGTTTACT CTTTGCCTTA ACGGCGGCGT ATTTGACGGG CGGTTCATCG
CTGTTAGTGC TTAAAGCCAA AAATATGTTC GACCTCGTGT TAGATAACCA ACTGGCGGTC
GTGCTGTTTA CCTTAGTGCT GGGTGGATTT GCGGCCTTAG GAGTCGCTTG GGTTGATAAA
GCCTCGCGCT TCTTGTTTTC GCTGATGATT TTATTGCTGA TTGTGGTCGT GCTGTTTTTA
TTACCGGAAG TCAGTATCTC GAGTATGGCA ACCAGTGCAG TGGCCGAGTC CATGACCAGC
AGTTGGATGG CGGCGATTCC TGTGGTGTTT ACTTCTTTTG GTTTCCACGT GTGTATCGCC
ACCTTAGTGC GTTATTTGGA TGGCAATGCT GTTTCGCTGC GCAAAGTATT ATTAATCGGT
TCAACCATTC CGCTCGCTTG TTATATCTTC TGGTTATTGG TGACCTTAGG CACAGTGGGT
GGCAACGAAG TTAGCAGCTT TAATGGCTCT TTACCTGCGC TGATCAGTGC ATTACAAGAG
ATTGCCCACA CGCCTTGGAT CAGCAAATGT ATTTCGCTGT TTGCGGATTT AGCCTTAATC
ACCTCTTTCC TCGGGGTCAC CTTAAGCCTG TATGATTTTG TGGCCGAACT GACCCGCGCA
AAGAAGACCT TCCTCGGCCG CGCCCAAACC TGGCTGTTAA CCTTTGTGCC GCCGCTGTTA
TGTGCGCTCT ATGTCCCCGA AGGTTTTGTT GCGGTATTAG GCTTTGCAGC CGTGCCGCTG
GTGGTGATGA TTATCTTCCT GCCGATCGTG ATGGCACTGC GTCAGCGCCA AGCCACGCCG
CAGGGATACC AAGTGTCTGG CGGCACATTT GCCCTCGGAA TTGCGGGTTT GCTAGGCGCA
GTGATTATCG GCGCTCAGTT ATTTGTCGCG CTGTAA
 
Protein sequence
MLGSIAIVAG TAIGAGMLAL PLATAALGMV PAILLMVVIW GLSAYTSLLM LEINLRSGVG 
DNVHAITGKL LGKKGQMVQG ASFLSLLFAL TAAYLTGGSS LLVLKAKNMF DLVLDNQLAV
VLFTLVLGGF AALGVAWVDK ASRFLFSLMI LLLIVVVLFL LPEVSISSMA TSAVAESMTS
SWMAAIPVVF TSFGFHVCIA TLVRYLDGNA VSLRKVLLIG STIPLACYIF WLLVTLGTVG
GNEVSSFNGS LPALISALQE IAHTPWISKC ISLFADLALI TSFLGVTLSL YDFVAELTRA
KKTFLGRAQT WLLTFVPPLL CALYVPEGFV AVLGFAAVPL VVMIIFLPIV MALRQRQATP
QGYQVSGGTF ALGIAGLLGA VIIGAQLFVA L