Gene P9301_08861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_08861 
Symbol 
ID4911233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp762087 
End bp763364 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content41% 
IMG OID640160468 
Productputative urea ABC transporter, substrate binding protein 
Protein accessionYP_001091110 
Protein GI126696224 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTT CAAGGCGTAT TTTGGCAGGT TTAGCTACTG CCTCACTTGC AGTCACCGCA 
ACTTCGTGTG GTGGAGGCGG AACCTCCGGA AGTTTCGATG ACACAGTAAC CGTTGGTATT
TTGCATTCGT TATCCGGAAC AATGGCTATC TCTGAATCAA CTCTTGTTGA TACAGAAAAA
ATGGCTATTG AAGAGATAAA TGCTGCTGGT GGCGTTAAAG TTGGCGGCAA AAGCTACAAA
ATAGAATACA TAGTTGAAGA TGGTGCATCT GACTGGCCTA CCTTTGCTGA AAAATCAAAG
AAACTTATAG ACCAAGACGG AGTTCCTGTC GTATTTGGTG GATGGACATC TGCAAGTAGA
AAGGCAATGT TACCTGTCTA CGAATCAAAA GATGCGTTCC TCTACTACCC GATTCAATAT
GAAGCCCAAG AATGTTCTAA TAACATTTTT TATACAGGAG CCACACCAAA CCAACAATCG
GAACCTGCTA CAGATTTCAT GTATAAGCGT TCTCCCGCAG CTGGTGGAGA TTTCTTCCTT
GTAGGTTCTG ATTATGTTTT CCCAAGAACT TCAAATACAA TCACAAAAGC CCAGGTAAAA
CAGTTAGGCG GCAAAGTTGT TGGAGAAGAT TACCTTCCAT TAGGAAATAC TGAAGTTGCT
CCAATTATCT CAAAAATCAA AAAGGCGCTT CCTGAAGGTG GAATAATCAT TAATACACTT
AATGGTGACC AAAACGTTGC ATTCTTCAAA CAGATTCAAG ACGCTGGTAT CACACCTTCG
AGTGGCTACT ACGTAATGAA CTATTCAATT GCTGAAGAAG AGATTAGTAC AATTGGACCT
GAGTTCCTTG AAGGTCACTA TGGTGCTTGG AACTACATGA TGTCAATTGA TACTCCTGCA
TCAAAGAAAT TTGCTAAAAG TTTCAAGAAA AGATGGGGAG CAGATCGTGT TGTAGCTGAT
CCTCAAGAAT CTGCTTACAA CATGGTTTAC TTATGGAAAC AAGCAGTTGA AGATGCTGGA
ACTTTCGACG ACAACGCAGT AAGGGAAGCC CTAGTTGGAC AAAAGTTTGA TGCCCCACAA
GGTCCAGTTG AAGTTATGCC AAACCATCAC TTATCTCAAA CTGTGAGAAT CGGAGAGATT
AATGCAGAAG GTGGCTTTAC AATTCTTGAA GAGACAGGAG TTGTTCTACC TCAAGCATGG
AACCAAAAGC ATCCAAGCTC AAAAGGATTT GCATGCGATT GGACAGATCC TTCAAAAGGA
GAAAAGTATA AGCTTTAA
 
Protein sequence
MRISRRILAG LATASLAVTA TSCGGGGTSG SFDDTVTVGI LHSLSGTMAI SESTLVDTEK 
MAIEEINAAG GVKVGGKSYK IEYIVEDGAS DWPTFAEKSK KLIDQDGVPV VFGGWTSASR
KAMLPVYESK DAFLYYPIQY EAQECSNNIF YTGATPNQQS EPATDFMYKR SPAAGGDFFL
VGSDYVFPRT SNTITKAQVK QLGGKVVGED YLPLGNTEVA PIISKIKKAL PEGGIIINTL
NGDQNVAFFK QIQDAGITPS SGYYVMNYSI AEEEISTIGP EFLEGHYGAW NYMMSIDTPA
SKKFAKSFKK RWGADRVVAD PQESAYNMVY LWKQAVEDAG TFDDNAVREA LVGQKFDAPQ
GPVEVMPNHH LSQTVRIGEI NAEGGFTILE ETGVVLPQAW NQKHPSSKGF ACDWTDPSKG
EKYKL