Gene A9601_08881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_08881 
Symbol 
ID4717594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp763633 
End bp764910 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content41% 
IMG OID640078600 
Productputative urea ABC transporter, substrate binding protein 
Protein accessionYP_001009279 
Protein GI123968421 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTT CAAGGCGTAT TTTGGCAGGT TTAGCTACTG CCTCACTTGC GGTCACCGCA 
ACTTCGTGTG GTGGAGGCGG AACCTCAGGA AGTTTCGATG ACACAGTCAC CGTTGGTATT
TTGCATTCGT TATCCGGAAC AATGGCTATC TCTGAATCAA CTCTTGTTGA TACAGAAAAA
ATGGCTATTG AAGAGATAAA TGCTGCTGGT GGTGTTAAAG TTGGCGGCAA AAGCTACAAA
ATAGAATACA TAGTCGAAGA TGGTGCATCT GACTGGCCAA CTTTTGCTGA AAAATCAAAA
AAACTTATAG ACCAAGACGG AGTTCCTGTC GTATTTGGTG GATGGACCTC TGCAAGTAGA
AAGGCAATGT TACCTGTCTA CGAATCAAAA GATGCGTTCC TCTATTACCC AATTCAATAT
GAAGCTCAGG AATGTTCTAA CAACATTTTC TATACAGGAG CCACACCAAA CCAACAATCG
GAACCTGCTA CAGATTTCAT GTATAAGCGT TCCCCCGCAG CTGGTGGAGA TTTTTTCCTT
GTAGGTTCTG ATTATGTTTT CCCAAGAACT TCAAACACAA TCACGAAAGC TCAGGTAAAA
CAGTTAGGCG GTAAAGTTGT TGGAGAAGAT TACCTTCCAT TAGGAAATAC TGAAGTTGCT
CCGATTATCT CAAAAATCAA AAAAGCGTTG CCTGACGGTG GAATAATCAT TAATACTCTT
AATGGTGACC AAAACGTTGC ATTCTTCAAA CAGATTCAAG ACGCCGGTAT CACACCTTCA
AGTGGATATT ACGTAATGAA CTATTCAATT GCTGAAGAAG AGATTAGCAC AATTGGTCCT
GAGTTCCTTG AAGGTCACTA CGGTGCTTGG AACTACATGA TGTCAATTGA TACTCCTGCA
TCTAAGAAAT TTGCTGCAAG TTTCAAGAAA AGATGGGGAG CAGATCGTGT TGTAGCTGAT
CCTCAAGAAT CTGCTTATAA CATGGTCTAC TTATGGAAAC AAGCAGTAGA AGATGCTGGA
ACATTCGACG ACAACGCAGT TAGGGAAGCC CTTGTTGGAC AAAAGTTTGA TGCTCCACAA
GGTCCTGTTG AAGTTATGCC TAATCACCAC TTATCTCAAA CAGTGAGAAT TGGTGAGATT
AATGCTGAGG GTGGATTTAC AATCCTTGAA GAGACAGGAG TTGTACTTCC TCAAGCATGG
AACCAAAAGC ATCCAAGTTC AAAAGGATTT GCATGCGATT GGACAGATCC TTCAAAAGGA
GAAAAGTATA AGCTTTAA
 
Protein sequence
MRISRRILAG LATASLAVTA TSCGGGGTSG SFDDTVTVGI LHSLSGTMAI SESTLVDTEK 
MAIEEINAAG GVKVGGKSYK IEYIVEDGAS DWPTFAEKSK KLIDQDGVPV VFGGWTSASR
KAMLPVYESK DAFLYYPIQY EAQECSNNIF YTGATPNQQS EPATDFMYKR SPAAGGDFFL
VGSDYVFPRT SNTITKAQVK QLGGKVVGED YLPLGNTEVA PIISKIKKAL PDGGIIINTL
NGDQNVAFFK QIQDAGITPS SGYYVMNYSI AEEEISTIGP EFLEGHYGAW NYMMSIDTPA
SKKFAASFKK RWGADRVVAD PQESAYNMVY LWKQAVEDAG TFDDNAVREA LVGQKFDAPQ
GPVEVMPNHH LSQTVRIGEI NAEGGFTILE ETGVVLPQAW NQKHPSSKGF ACDWTDPSKG
EKYKL