Gene Saro_2400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2400 
Symbol 
ID3916719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2567075 
End bp2568604 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content65% 
IMG OID640445155 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_497670 
Protein GI87200413 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATCC GCGCCGCTGA AATCTCGAAG GTCATCAAGG ACCAGATCGC CAGCTTCGGC 
ACCGAGGCCC AGGTCTCGGA AGTCGGGTCG GTTCTCTCGG TCGGTGACGG CATCGCCCGC
ATCCACGGCC TCGACAAGGT CCAGGCCGGT GAAATGGTCG AATTCTCGAA CGGTGTGAAG
GGCATGGCCC TGAACCTCGA AGCCGACAAC GTCGGCGTCG TGATCTTCGG CTCGGACTCC
GAGATCAAGG AAGGCGACGT CGTCAAGCGC ACCGGCACCA TCGTCGACGT TCCCGTCGGC
AAGGGCCTGC TCGGCCGCGT GGTCGACGCG CTCGGCAACC CGATCGACGG CAAGGGCCCG
ATTGTCGACG CCACCCGCCA GCGCGTCGAA GTCAAGGCTC CCGGCATCAT CCCGCGCAAG
TCGGTGCACG AACCCGTGCA GACCGGCCTC AAGGCGATCG ACGCCCTCGT CCCCGTCGGC
CGTGGCCAGC GCGAGCTGAT CATCGGTGAC CGTCAGACCG GCAAAACCGC CGTCGCGATC
GACACCTTCA TCAACCAGAA GGCCGTCAAC GCCGGCACCG ACGAAGGCAA GAAGCTCTAC
TGCATCTACG TCGCCGTCGG CCAGAAGCGC TCGACCGTCG CGCAGATCGT CCGCCAGCTC
GAAGAGAACG GCGCGATGGA ATACTCCATC GTCGTGGCCG CGACCGCTTC GGAACCGGCT
CCGCTCCAGT ACCTCGCGCC CTACACCGGT GCGACCATGG GTGAATTCTT CCGCGACAAC
GGCATGCACG CCGTGATCGT GTACGACGAC CTTTCCAAGC AGGCCGTCGC CTACCGTCAG
ATGTCGCTGC TGCTCCGCCG TCCTCCGGGC CGCGAAGCCT ACCCCGGTGA CGTGTTCTAT
CTCCACAGCC GCCTGCTCGA GCGCGCCGCG AAGATGAACG ACGAAAACGG CGCTGGCTCG
CTCACCGCTC TGCCGATCAT CGAAACCCAG GCGGGCGACG TGTCGGCCTA CATCCCGACC
AACGTGATCT CGATCACCGA CGGCCAGATC TTCCTTGAAA CCGGCCTGTT CTATCAAGGC
ATCCGTCCGG CCATCAACGT CGGTCTGTCG GTGTCGCGCG TCGGCTCCTC GGCCCAGACC
AAGGCGATGA AGAAGGTTGC CGGCTCAATC AAGCTGGAAC TCGCGCAGTA CCGCGAAATG
GCGGCCTTCG CGCAGTTCGG TTCGGACCTC GACGCCTCGA CCCAGAAGCT CCTCAACCGC
GGTGCGCGCC TGACTGAACT GCTCAAGCAG CCCCAGTTCT CGCCGCTCGG CTTCGAAGAG
CAGACCTGCG TGATCTTCGC CGGCACCCAG GGCTACCTGG ATGCCGTTCC GGTGAACCGC
GTCACCGAAT ACGAAGCCGA ACTGCTCAGC TTCCTGCGCT CGCAGCATGC CGACCTGCTC
GGCCTGATCC GCGACACCAA GGACCTTGGC GACGAAGCCA AGGGCAAGCT GGTCGCGGCT
CTCGACGCTT TCGCCAAGCA GTTCGCATAA
 
Protein sequence
MEIRAAEISK VIKDQIASFG TEAQVSEVGS VLSVGDGIAR IHGLDKVQAG EMVEFSNGVK 
GMALNLEADN VGVVIFGSDS EIKEGDVVKR TGTIVDVPVG KGLLGRVVDA LGNPIDGKGP
IVDATRQRVE VKAPGIIPRK SVHEPVQTGL KAIDALVPVG RGQRELIIGD RQTGKTAVAI
DTFINQKAVN AGTDEGKKLY CIYVAVGQKR STVAQIVRQL EENGAMEYSI VVAATASEPA
PLQYLAPYTG ATMGEFFRDN GMHAVIVYDD LSKQAVAYRQ MSLLLRRPPG REAYPGDVFY
LHSRLLERAA KMNDENGAGS LTALPIIETQ AGDVSAYIPT NVISITDGQI FLETGLFYQG
IRPAINVGLS VSRVGSSAQT KAMKKVAGSI KLELAQYREM AAFAQFGSDL DASTQKLLNR
GARLTELLKQ PQFSPLGFEE QTCVIFAGTQ GYLDAVPVNR VTEYEAELLS FLRSQHADLL
GLIRDTKDLG DEAKGKLVAA LDAFAKQFA