Gene P9303_30031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_30031 
Symbol 
ID4778021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2657411 
End bp2658529 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content55% 
IMG OID640088527 
ProductNAD binding site:D-amino acid oxidase 
Protein accessionYP_001018998 
Protein GI124024691 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.549053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTGCT CGTCAATATC AAGAATGACT GCTTCCACCG TTGCAATCAT TGGTGCCGGT 
GCGGTTGGCG CTGGCACAGC CTGGTATTTA GCCAAGCATG GCCACCAAGT GATGCTGATT
GATCCAAAAC TGGATCAACC GATTAACCGA TCAGGGGCCC TCCCGGGAAC AACTGCTTCG
CTAGGGGTAC TCATGGGGCA TGTTTTCAGG CGCAGCAGTG GACGAGCCTG GCGACTCCGA
CAACACAGCA TGACCCTCTG GCCAGAGTGG GTTGCAGAAC TGAGCAGTCA AGAGCATCCG
CTCAAGCTCA ACACGCCTCT GATTCAACTT GCGAGTAGCG AAGCAGAAGC CACCCTGATG
AAGCAACTCA CAGAACAACG GCAACATCTG GGCCTTGAGC TGATCTCACC AAACTCGAAT
CCCTGCATGG GCCGATCATG GCCAAACACC CAACATGGGG GCCTGATCTC TCATCAAGAC
GGTTATCTAG ACCCGATCGC CCTACAACAA TGCCTACGGG CCGCCCTACA AGACCAAGGC
GTACAACAAA TCCAAGAGCC AGTTGTCTCG CTGGAACGAA ATTCATCTGT CGAAGAAAAA
CAGTGGCGCC TTCAACTTGC AGGAGGAACG AATTTGAACC AAGACGCTGT CGTGATCTGT
GCAGCACTTG GCAGCGAAGC CCTGCTGGAA CAACTAGGCC ACAGTCTTCC CATGGCCCCT
GTGCTTGGAC AAGTGCTGGA TCTAGAGGTG ATCTCAGATC AGCACAATTG GAGCGGCTGG
CCTGCAGTAC TCGTGAGCCA TGGCATCAAC CTGATCCCCC ACGGACCCAA TCAGATCTGG
ATAGGTGCCA CTCTCGAGCC AGGAGTGCAA CCAATAGCGA GCCACCTAAA GGCCATGCAA
CACCTCGAGG GAGATGCCCC GGATTGGTTA GAAAGCGCGA CTGTGAAAGA CCAATGGCAT
GGATTGCGCG CTCGACCTGT CGAACGTCCA GCACCTCTTT TAGAAAAACT AGAGCCCGGG
CTAATCGTGG CTACAGGCCA TTACCGAAAT GGCGTCTTGC TCGCCCCGGC CAGCGCTGCA
TGGGTCAAAG AGCAACTCAC TAACGAGACA AGATCTTGA
 
Protein sequence
MHCSSISRMT ASTVAIIGAG AVGAGTAWYL AKHGHQVMLI DPKLDQPINR SGALPGTTAS 
LGVLMGHVFR RSSGRAWRLR QHSMTLWPEW VAELSSQEHP LKLNTPLIQL ASSEAEATLM
KQLTEQRQHL GLELISPNSN PCMGRSWPNT QHGGLISHQD GYLDPIALQQ CLRAALQDQG
VQQIQEPVVS LERNSSVEEK QWRLQLAGGT NLNQDAVVIC AALGSEALLE QLGHSLPMAP
VLGQVLDLEV ISDQHNWSGW PAVLVSHGIN LIPHGPNQIW IGATLEPGVQ PIASHLKAMQ
HLEGDAPDWL ESATVKDQWH GLRARPVERP APLLEKLEPG LIVATGHYRN GVLLAPASAA
WVKEQLTNET RS