Gene P9303_24401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_24401 
Symbol 
ID4777920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2145924 
End bp2147234 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content56% 
IMG OID640087960 
Productbicarbonate transporter, ICT family protein 
Protein accessionYP_001018436 
Protein GI124024129 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID[TIGR00947] probable bicarbonate transporter, IctB family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAGA CTGCGGTCCC AAAGCCCCTT CTACTGCGCT GGCAGGGACG CATTCCCTCC 
TCTGAGGCGA TGCAAATGCG CCTGCAGTGG ATTGCGGGGT TGCTGTTGAT GATGCTCCTA
GCAACCCTGC CCATGCTGAC TCGAACAGGG CTGGGACTAA CAATCCTCGC CGCCGGAGCG
TTATGGATCA TCTGGGGCTG CGTGACACCA GCTGGCCGAA TTGGAAGCAT CAGTAGCTGG
TTACTTGTGT TTCTCGCTAT TGCATTGCTC GCCACAGGAT TCTCACCCGT TCCATTGGCA
GCTGCCAAAG GATTGATCAA ACTCATCAGC TACCTGGGGG TGTACGCACT GATGCGGCAG
CTACTAGCCA CAAGGAGCGA CTGGTGGGAT CGCCTGGTGG CTGCCCTACT AACCGGCGAA
CTGATCTCTT CTGTGATCGC AATCAGGCAG CTCTATGCCC CCGCTGAGGA AATGGCCCAC
TGGGCAGATC CCAATTCAGT GGCTGCAGGG ACAGTGCGAA TTTATGGTCC GCTTGGTAAT
CCCAACCTGC TAGCCGGCTA TCTCATACCC ATCCTGCCGC TGGCCTTAGT AGCCCTACTG
AGATGGCAAG GCTTGGGGGC AAAGCTTTAC GCGATGGTCG CTCTAGGGCT TGGCATCACA
GCAACCCTAT TCAGCTTCAG CCGCGGTGGA TGGCTAGGCA TGCTTTCCGC TCTAGCTGTG
ATTTTGGTGC TGCTGCTGTT GCGCAGTACC AGCCACTGGC CTCTCGTCTG GCGTCGTCTG
CTGCCCCTAA TCGTGATTGT TTTGGGCACA GCCATGCTGG TGATAGCAGC AACCCAGATT
GAGCCCATCC GCACCCGAAT CACAAGCTTG ATCGCAGGGC GAAGTGACAG CTCTAACAAC
TTCCGCATCA ACGTTTGGCT ATCGAGCCTT GAAATGATTC AGGCACGCCC ATGGCTGGGT
ATTGGCCCTG GCAACGCTGC CTTCAACAGG ATCTATCCGC TCTTTCAACA GCCCAAATTC
AACGCCCTAA GTGCCTACTC TGTTCCCCTG GAAATCCTTG TCGAAACCGG ACTGCCTGGC
CTCATTGCAA GTCTCGCTCT AGTAATCACC AGCATCCGCA AGGGCCTCGC TGGCCTCAAC
TCAAACAATC CGCTGGCCCT CCCCGCTCTG GCAAGCCTGG CCGCCATGGC TGGGCTTGCG
GTTCATGGCA TCACAGATAC CATTTTTTTT CGACCTGAGG TTCAACTCGT GGGCTGGTTC
TGCCTCGCCA CACTGGCCCA AACACAGCCA GAACAAAAGC AACTCCAATA G
 
Protein sequence
MPKTAVPKPL LLRWQGRIPS SEAMQMRLQW IAGLLLMMLL ATLPMLTRTG LGLTILAAGA 
LWIIWGCVTP AGRIGSISSW LLVFLAIALL ATGFSPVPLA AAKGLIKLIS YLGVYALMRQ
LLATRSDWWD RLVAALLTGE LISSVIAIRQ LYAPAEEMAH WADPNSVAAG TVRIYGPLGN
PNLLAGYLIP ILPLALVALL RWQGLGAKLY AMVALGLGIT ATLFSFSRGG WLGMLSALAV
ILVLLLLRST SHWPLVWRRL LPLIVIVLGT AMLVIAATQI EPIRTRITSL IAGRSDSSNN
FRINVWLSSL EMIQARPWLG IGPGNAAFNR IYPLFQQPKF NALSAYSVPL EILVETGLPG
LIASLALVIT SIRKGLAGLN SNNPLALPAL ASLAAMAGLA VHGITDTIFF RPEVQLVGWF
CLATLAQTQP EQKQLQ