Gene Dshi_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0039 
SymbolribA 
ID5711661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp41277 
End bp42347 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content70% 
IMG OID641265933 
ProductGTP cyclohydrolase II 
Protein accessionYP_001531389 
Protein GI159042595 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.000270099 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCCTGA CCCCCGATCC GCGCGAATTG CGCGCCCGTG CCTGGGCGGA CCTGCGCATG 
GGCGTGCCCG TGGTGCTGCA TTCCGAGGGC CGTGCGGTGC TGGCGCTGGC GGGCGAGACG
CTCAAGCCTG CGCGATTGTC GGTGGTGGCG GGCATGGCGG AAGCGGTGCT CGCGATCACG
GCGCGGCGGG CCGAGACCCT GCGCGCGGTG CCCTATGACG GGGATATCGC GCGGATCGCC
CTGCCGGGCA ATGCGGATGC GCACTGGGTG CGGGCGGTGG CCGATCCTGC GGATGATCTG
CGGATGCCGA TGAAGGGGCC GTTCCGGGTG CTGCGCGACG GAGACGCAGT GTTGCACCGG
CTGGCGCTGA CCCTGTGCAA GGAAGCGCGG CTCTTGCCTG CGGCGGTGGT CGCGCCCGTG
GTGCCCGGGT TCGGCCCGGC GGAGGGTCTG ACGGTTCTGG ATGCCGCCGA CCTGCGCGTG
CCGATGGTGA TGGACGAGGT CGTCTCGGCC CGCGTGCCGC TGTCGGTGTC GGAGGCGGGG
CGGCTGCATG TGTTCCGGCC CGAGGATGGC AGCGAGGAGC ATTACGCGGT CGAGATCGGC
ACGCCGCCGC GCGACCAGCC GGTGCTGGCG CGGTTGCATT CGGCGTGTTT CACCGGTGAC
CTGCTGGGGT CGCTGAAATG CGATTGCGGG CCGCAATTGC GTGGGGCGCT GGCGCAGATG
GGGGCCGAAG GGGCGGGCGT ATTGCTGTAC CTGAACCAGG AGGGGCGGGG GATCGGGCTG
GCCAACAAGA TGCGCGCCTA TGCCTTGCAG GACCAGGGGT TCGACACGGT GGAGGCGAAC
CACCGGCTGG GCTTCGAGGA TGACGAGCGG GATTTCCGGA TCGGGGCGGA GCTTCTGCGG
CGGCTGGGGT TTTCGGCCAC GCGGCTCATG ACGAACAACC CGGCCAAGGT GGCGATGATG
GAGAATTGCG GGATCGCGGT GACCGAGCGC GTGCCGCTCA AGGTCGGGGA GACGCCGCAG
AACGCCGGGT ACCTCGCGAC CAAGGCGGCG AAGTCGGGGC ATTTGTTGTA G
 
Protein sequence
MSLTPDPREL RARAWADLRM GVPVVLHSEG RAVLALAGET LKPARLSVVA GMAEAVLAIT 
ARRAETLRAV PYDGDIARIA LPGNADAHWV RAVADPADDL RMPMKGPFRV LRDGDAVLHR
LALTLCKEAR LLPAAVVAPV VPGFGPAEGL TVLDAADLRV PMVMDEVVSA RVPLSVSEAG
RLHVFRPEDG SEEHYAVEIG TPPRDQPVLA RLHSACFTGD LLGSLKCDCG PQLRGALAQM
GAEGAGVLLY LNQEGRGIGL ANKMRAYALQ DQGFDTVEAN HRLGFEDDER DFRIGAELLR
RLGFSATRLM TNNPAKVAMM ENCGIAVTER VPLKVGETPQ NAGYLATKAA KSGHLL