Gene P9303_03991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03991 
Symbol 
ID4777025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp400178 
End bp402136 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content54% 
IMG OID640085902 
ProductABC transporter ATP-binding protein 
Protein accessionYP_001016416 
Protein GI124022109 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCTGA TCAGCCTGAT AGAGGCCTCA AAAGACTTCG GCATCAACAC CCTGTTTGCC 
GACCTCACCC TGCACATCAA CGAAAGAGAA CGTCTAGGGC TGATCGGACC GAATGGAGCC
GGGAAATCGA CGCTCCTGAA GGTGCTCGCC GGAGAAGAAC CCCTCGGAGA GGGTGAAAGG
AGGTGCTCAG CGCGATTACG CGTAGAGCTC GTGGGCCAAG AGAGCGCTGT AAACCCTGGC
CATACGGTGC TGCAAGAAGT CTTGGCCGGG TGCGGTGAAA AGAGAGAGCT GCTGCTGCGT
TTCAACGAGC TCAGCAATTC GATGGCCCGC AACCCCAACG ATGCAACGCT TTTGGCTGAG
TTAGGACAAG TCAGCCAGCG GATGGATGAC GCTCAAGCCT GGAGCTTGGA ACAGCAATGT
CAAGAGGTCC TTCAACGTCT TGGTATCACC GATCTAGAGC GGCCGGTTGA AGAACTCTCT
GGTGGGTATC GCAAGCGCGT CGGCCTTGCC TCCGCCTTGG TCGCCAGACC CGACGTTTTG
CTGCTCGATG AGCCCACCAA CCATCTCGAT GCGGCAGCAG TTGAGTGGCT ACAAAGTTGG
CTCGATCGCT TTCCAGGTGC TTTGGTGCTT GTCACCCACG ATCGCTACGT GCTCGATCGG
GTGACACGCC GCATGGTGGA AGTAGACCGA GGCAAAGCCC ATAACTATGC CGGCAACTAC
AGCACTTTTC TACAACAAAA GGCCGAACTT GAAGCTTCAG AAGCTTCCAC TGCTCACAAG
TTCAAAGGGG TTTTAAGACG AGAGCTGGCC TGGTTGCGAC AGGGTCCCAA AGCACGCAGT
ACCAAGCAGA AGGCACGCCT GCAGCGAATT GAGGAGATGC GCGCCGAACC ATTGCCGCAA
CTGCGGGGTT CCTTGAAGAT GGCGAACGTA AGCAGGCGTA TCGGCAAGCT AGTGATTGAG
GCGGAAGCCC TGCAAGTCAC CGCCAATGGC AAGCCCGACA GTCCTCTGCT TCTAGACAAC
TTCACCTACA GCTTTAGTCC AGAAGACCGC GTTGGCATCA TCGGCCCGAA TGGCAGTGGC
AAATCCACAT TGCTTGATTT AATTGCAGGC AGACGCCAAC CCAATGGTGG AACACTGCGA
TTAGGTGAAA CAGTGCACCT GGGATATCTC GACCAGCACA CCGAAGACAT CACCAAAGGC
AAGGGCCTAG ATCGCAAAGT GATCGATTTT GTTGAAGAAG CTGCCTCTCA AATCATTCTG
GGAGAAGAAC AAATTACGGC CTCTCAACTA CTGGAACGCT TCCTATTTCC ACCGGCTCAG
CAGCACAGCC CACTGGGGAA GCTCTCTGGG GGAGAACGAC GCCGACTCAC ACTCTGCCGG
ATGCTGATCC AAGCACCGAA TGTGCTGCTG CTTGATGAAC CCACCAACGA CCTAGATATC
CAGACACTAA GCGTGCTGGA AGATTTCCTT GAGGATTTCC GCGGCTGCGT TGTCGTGGTC
TCTCATGACC GCTATTTCCT CGATCGCACA GTTGATCGTC TGTTCAACTT TGAGAATGGA
CAATTGAAAC GTTTTGAGGG CAATTACAGC GCCTTTCTTG AACAACAACG ACGACAAGAG
CGAGACTTCA ACGAAGCAAA TGAGCCCAAA TCCAGCCTCC TTTTGCAGGA TTCTGGCCCA
TCTAGAATAT CCAAAAGATC CTCTCAAGAG GCAGAAAGCT CTTCACCCCA AGCCACAGAA
ACCAGCAAGC CAAGACGCCG AAGCTTCAAG GAATCACGTG AACTAGAAGC TCTAAACATT
GATATCCCAC TGCTTGAAGC CAAGCGTTCT AACCTTGAAG CTGCTTTGTC CGGTGGCGAC
GAAGACTTAA CTCTGCTTAG TCAACAATTG GCCGAGCTGG TCGAAACATT GCACAAAGCA
GAGGAGCGCT GGCTCGAACT CAGCGAACTG GCCATATAA
 
Protein sequence
MSLISLIEAS KDFGINTLFA DLTLHINERE RLGLIGPNGA GKSTLLKVLA GEEPLGEGER 
RCSARLRVEL VGQESAVNPG HTVLQEVLAG CGEKRELLLR FNELSNSMAR NPNDATLLAE
LGQVSQRMDD AQAWSLEQQC QEVLQRLGIT DLERPVEELS GGYRKRVGLA SALVARPDVL
LLDEPTNHLD AAAVEWLQSW LDRFPGALVL VTHDRYVLDR VTRRMVEVDR GKAHNYAGNY
STFLQQKAEL EASEASTAHK FKGVLRRELA WLRQGPKARS TKQKARLQRI EEMRAEPLPQ
LRGSLKMANV SRRIGKLVIE AEALQVTANG KPDSPLLLDN FTYSFSPEDR VGIIGPNGSG
KSTLLDLIAG RRQPNGGTLR LGETVHLGYL DQHTEDITKG KGLDRKVIDF VEEAASQIIL
GEEQITASQL LERFLFPPAQ QHSPLGKLSG GERRRLTLCR MLIQAPNVLL LDEPTNDLDI
QTLSVLEDFL EDFRGCVVVV SHDRYFLDRT VDRLFNFENG QLKRFEGNYS AFLEQQRRQE
RDFNEANEPK SSLLLQDSGP SRISKRSSQE AESSSPQATE TSKPRRRSFK ESRELEALNI
DIPLLEAKRS NLEAALSGGD EDLTLLSQQL AELVETLHKA EERWLELSEL AI