Gene NATL1_00611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00611 
SymboldadA 
ID4780671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp67001 
End bp68104 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content40% 
IMG OID640083324 
Productputative thiamine biosynthesis oxidoreductase 
Protein accessionYP_001013890 
Protein GI124024774 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.741936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTCT TAAATGAAAA ACCATTGTTA ATCCTCGGAG GAGGATTAAT GGGTCTTGCC 
ATAGCCCATG AACTTGCTAA AAGAGGCAAA CGAGTAGAAG TTTTAAGTAG AAGCAGACGT
GAAGCAGCAG GCTTTGTTGC TGCTGGAATG CTCGCCCCTC ACGCTGAAGG GCTTCAAGGT
AATCTTTTAA ATCTTGGTAA AAGTAGTCTT CAAAGGCACT CAACATGGAT AGAAAACATT
GAGACAAATA GCAAAATGTC ATGTGGTCTC AAAACTTGCG GGATTGTTGT CCCATTTGAA
AGCCACAAAG ACTGTGAATC CTATCCAACA TATAAATTTG GTAAAAAGCT AAACAGAATT
GAGCTCCTTC AAGAAGTTCC GAGACTCTCA GAAAAATGGA AACTAGGTTT ACTTTTTAAG
CAAGACGGCC AAATCGATAA TCGAAGACTT TTAATGAGAG CACTTGAAAA AGCTTGCTTT
GAATTAGGTG TTCACTTTCA AGAAGGAGTT GAAGTGGTTG AAATAATGAA AGGTCTAAAC
AAATTTAATG GGGTCAAAAT CAAAGACATT AATGGAAATA TCAATCATTT AAAAAGTGAA
GAGGCTGTTC TCTGCTGCGG AGCCTGGAGC AAACAAATTT TTAAAACATT GCCTATTTTT
CCTGTTAAAG GCCAGATGTT ATCTATTCAG GGTCCAAAAC AGATTCTTAA AAGAATTGTT
TTTGGACCTG GCATTTACTT AGTGCCAAGA GATGACGGTT TAATAATCGT AGGGGCAACT
AGTGAGCCTG AGGCAGGCTT CCAGACAGGA CTCACTCCAA ATGGGCAAAG CGAGCTTCAA
AAAGGAATTC AATCTCTTAT TCCTGAACTT AATCAACTAC CTCATATGGA GAGATGGTGG
GGTTTTCGTC CATGCACACC CGACGAAGGT CCCTTACTGG GAATGTCATC AATTAATGGA
CTCTGGCTTG CTACTGGGCA TCATCGCAAT GGAGTTCTAT TGGCAGCGAT AACTTCAGAA
TTAATTGGAA AATCAATTTG CTCAACTCCT TTAAGTAATG AGGAAGATAG TTTGTTGTCC
CATTTCAGAT GGGACAGATT TTAA
 
Protein sequence
MGVLNEKPLL ILGGGLMGLA IAHELAKRGK RVEVLSRSRR EAAGFVAAGM LAPHAEGLQG 
NLLNLGKSSL QRHSTWIENI ETNSKMSCGL KTCGIVVPFE SHKDCESYPT YKFGKKLNRI
ELLQEVPRLS EKWKLGLLFK QDGQIDNRRL LMRALEKACF ELGVHFQEGV EVVEIMKGLN
KFNGVKIKDI NGNINHLKSE EAVLCCGAWS KQIFKTLPIF PVKGQMLSIQ GPKQILKRIV
FGPGIYLVPR DDGLIIVGAT SEPEAGFQTG LTPNGQSELQ KGIQSLIPEL NQLPHMERWW
GFRPCTPDEG PLLGMSSING LWLATGHHRN GVLLAAITSE LIGKSICSTP LSNEEDSLLS
HFRWDRF