Gene A9601_07001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_07001 
Symbol 
ID4717403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp623058 
End bp624623 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content35% 
IMG OID640078413 
Productputative sulfate transporter 
Protein accessionYP_001009093 
Protein GI123968235 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAATT TCTCAAGATA TTTATCTAAA AATTGGTTAG ATGATCCAAA GTCAAATATT 
CTCTCTGGCT TAGTTGTAGC TTTTGCAATG ATCCCAGAAG CAATTGCTTT TTCAGGTATA
GCTGGTGTAG ATCCTAAAGT TGGCCTTTTT GGTGCATTTT GCCTATCTAT AACGATTGCC
ATTGTTGGAG GTAGAAGGGG GATGATCACT TCAGCCACAG GTTCAACAGC TCTTTTAATG
ACTGGACTTG TTGCTTATGG AGAATCACAA GCTCCTGGAT TAGGAGTCCC ATATCTCATT
GCAGCTGGAA TATTAACTGG AATTTTCCAA ATTCTTTGGG GATACTTAAG GCTTGCCTAC
CAAATGCGAT TCGTTCCAAC AGGAGTATTA AGTGGATTTG TAAATGCATT GGCACTTTTA
ATATTTCAAG CACAACTACC TCAGTTAGGA ATAGGGATTA AAGAATCAAA AGAATTAGTT
GAACAAACTA TCAGTCAATA TCCAATTAAC TCTCAGATTC CAGTAGTTTG GATTCTTGTA
ATCCTAGGAT TAGTAATTAT CTATGGACTT CCAAAAATCA CAAAAGTAGT ACCATCGCAA
CTTATCGCAA TAGTAGTAAT TACTCTTATA AGCATATTTT TTAATCTAGA TGTCCCAACA
GTTAGCGATT TGGGTAAATT ACCTGATGGA TTGCCAAGTA TTTCTCTTCC TTTTGGATCA
ATAGAAAATG GGAAAGTACC TTTTAGTCTT GAAACATTAG GGATAATTTT ACCGACTTCA
CTTGCAATAT CTCTCGTAGG TTTAATGGAA ACCTTTTTAA CTCAAGACAT TTTAGACGAT
GTAACTGATA CAAGTTCTAA TAAAAATAAA GAAGCAAGAG GACAGGGAAT CGCAAATATT
GTGGCATCCT TATTTGGTGG AATGGCAGGA TGTGCCTTAG TTGGGCAATC TGTAATGAAT
ACTGAGAATG GTGGTAAATC TAGATTATCA ACCCTCTCCT CAGGTATATC TCTACTAATT
ATGATTATCC TCTTGAAGTC TTGGATTGGA GCAATCCCAA TGGCGGCTTT AGTAGCAATC
ATGATAACGA TTGCAATAAG TACAGCAGAT ATAAATGGAT TAAAAAATAT TAGAAAGATA
CCTAAAAGCG ATACTGCAGT CATGCTTATG ACTTTCTCAG TTACTATGCT TACAAAACCT
CATAATCTTG CGCTTGGAGT TATTGCAGGA GTTGCATTAG CTGCAATTCT TTTCAGCAGA
AAAGTTGCAA AAGTTATAAC TGTCTCAGGA GCAAAAGAAA ATAATTTAAC TACCTATAAA
GTAAAAGGAC AATTATTTTT TGTAAGTAAA ATTTATTTTT TACAAGGATT TGATATTCAT
GAACATCCAG AAAATATTGT AATTGACATG TCTTTAGCTC ATATTTGGGA TCAAAGTGGC
GTTGTTGCTC TTGAGCAAAT TATTAGAAAA TTCCAGAATG GTGGATCTAA AGTTGAAATT
GTAGGATTAA ATAAAGAAAG TCTTAACCTA TTTGAAAGAC TAGGTGGTAT TGAAAGCGCT
CATTGA
 
Protein sequence
MSNFSRYLSK NWLDDPKSNI LSGLVVAFAM IPEAIAFSGI AGVDPKVGLF GAFCLSITIA 
IVGGRRGMIT SATGSTALLM TGLVAYGESQ APGLGVPYLI AAGILTGIFQ ILWGYLRLAY
QMRFVPTGVL SGFVNALALL IFQAQLPQLG IGIKESKELV EQTISQYPIN SQIPVVWILV
ILGLVIIYGL PKITKVVPSQ LIAIVVITLI SIFFNLDVPT VSDLGKLPDG LPSISLPFGS
IENGKVPFSL ETLGIILPTS LAISLVGLME TFLTQDILDD VTDTSSNKNK EARGQGIANI
VASLFGGMAG CALVGQSVMN TENGGKSRLS TLSSGISLLI MIILLKSWIG AIPMAALVAI
MITIAISTAD INGLKNIRKI PKSDTAVMLM TFSVTMLTKP HNLALGVIAG VALAAILFSR
KVAKVITVSG AKENNLTTYK VKGQLFFVSK IYFLQGFDIH EHPENIVIDM SLAHIWDQSG
VVALEQIIRK FQNGGSKVEI VGLNKESLNL FERLGGIESA H