Gene P9303_21681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21681 
SymbolalsT 
ID4777499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1923479 
End bp1924873 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content55% 
IMG OID640087678 
ProductSodium:alanine symporter family protein 
Protein accessionYP_001018168 
Protein GI124023861 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1115] Na+/alanine symporter 
TIGRFAM ID[TIGR00835] amino acid carrier protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0888234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTGT TTCCAGCGTC GCTCTTGACG AATTTATCCT TACAACTCGG ACAGTTCCCA 
AGCGGACTGG AAGACGCAGT CGAGGCCATC AATAATCCGA TCAACGGCTT TGCCTGGGGT
TGGCCCACAG TCATCCTGAT TGCAGGCACC GGCATCTTGC TGATGGTTGG GCTGGGCTTT
ATGCCCTTAC TGCGCATCCC CTATGGCGTG CGCATGCTGC TTCGCAATCC AACCTCCTCT
AGCGAGGGCG AAATCAGTCC ATTCCAGGCA CTGATGACCT CGATGGCGGC CACGATCGGC
ACGGGCAACA TCGCCGGTGT CGCTGTTGCG ATCGCCATGG GCGGCCCTGG GGCGGTGTTC
TGGATGTGGC TAATTGCCAT TTTTGGTATT GCCACCAAGT ACGCCGAAGC CTTACTTGCA
GTTCACTTCC GCGAAGTGGA CCCCCTCGGC AATCATGTCG GTGGTCCGAT GTACTACATC
CGCAATGGCC TAGGTCCAAA CTGGGCCTGG CTGGGCGGAT TCTTTGCCCT GTTTGGAATG
CTGGCGGGCT TTGGCATTGG CAATGGCGTG CAATCGTTTG AGGTCTCCAG TGCCTTAGCC
ACGATCGGCA TCCCTCGGCT TTTAACGGGT GTCGTGCTTG GAGTGCTTGT CTTTGGGGTC
ATCATTGGCG GCATCAAACG CATCGCCCAG GCTGCATCCG CCATCGTTCC TTTGATGTCG
TTGTTTTATG TGATTGCTTG CCTGGTCATC ATTCTCAGCA ACATCAGCGA AGTGCCAGCA
GCGTTCTCAA CGATCTTCTC TAATGCCTTC ACAGGCGAAG CCGCTGCCAG CGGCACGTTG
ACCCAAGTGA TCCTGATGGG CTTCAAGCGC GGCATCTTCT CCAATGAAGC TGGTCTCGGT
AGTGCGCCAA TAGCTCACGC TGCCGCCAAC ACCAATGACC CAGTGCGTCA GGGCACTATC
GCCATGCTTG GAACCTTCAT CGATACTTTG ATCATCTGCA CAATGACGGC TCTGGTGATC
ATCACCACCG GTGCCTATCA GAGTGGTGAG TCAGGCTCTG ATCTATCAAT CGCTGCCTTC
AACAGTGGCC TTGCAGGCTC AGGTTGGGTC GTGACAGCTG GCCTCGTGGT GTTTGCGCTA
ACAACAGTTC TTGGCTGGGG CTTTTACAGC GAACGCTGCA CTGAATATCT CTTTGGGGTG
CAAGCCATTC TCCCCTTCCG CCTGGTGTGG GTCGCTGTAG TTGTCATTGG TGCTGTTGCA
GGCAATCGCG GCGTGGTGTG GGACGTAGCT GACACACTTA ATGGTCTGAT GGCGATTCCT
AACTTGATCG CACTGGTGCT GCTCTCAGGC ACTGTCTTCC GCCTCTCCAA AAACTACCGA
TTTGAAGAGG ACTAA
 
Protein sequence
MDVFPASLLT NLSLQLGQFP SGLEDAVEAI NNPINGFAWG WPTVILIAGT GILLMVGLGF 
MPLLRIPYGV RMLLRNPTSS SEGEISPFQA LMTSMAATIG TGNIAGVAVA IAMGGPGAVF
WMWLIAIFGI ATKYAEALLA VHFREVDPLG NHVGGPMYYI RNGLGPNWAW LGGFFALFGM
LAGFGIGNGV QSFEVSSALA TIGIPRLLTG VVLGVLVFGV IIGGIKRIAQ AASAIVPLMS
LFYVIACLVI ILSNISEVPA AFSTIFSNAF TGEAAASGTL TQVILMGFKR GIFSNEAGLG
SAPIAHAAAN TNDPVRQGTI AMLGTFIDTL IICTMTALVI ITTGAYQSGE SGSDLSIAAF
NSGLAGSGWV VTAGLVVFAL TTVLGWGFYS ERCTEYLFGV QAILPFRLVW VAVVVIGAVA
GNRGVVWDVA DTLNGLMAIP NLIALVLLSG TVFRLSKNYR FEED