Gene Gdia_0609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0609 
Symbol 
ID6974006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp684893 
End bp686299 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content67% 
IMG OID643390140 
Productcytochrome P450 
Protein accessionYP_002275016 
Protein GI209542787 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.484343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0324915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTCA TCCCTCCCTT TCCGCCCCGT CCGAAGACAT CGCTGTCGGT ATTCGAGCTG 
TTGCGGAGAG GAACGCAAAA CTTCCTGAAT ATATGGGAAG AAAAGGCATT CGAATACCAG
ACGATGTCGA TGCAGGTTCT GGCCCGGCAG GTCTTCATCT GCAACAGTCC GGACACCGTC
CGTCACGCCT TCATCACCCG TGCCGAGAAT TTTCAGCGCA AGAGCCCGCA GATACGCAAC
GCCCTGTCGC CGCTGCTGGG TGACGGGCTG TTCGTCAGCG ACGGCGAGAC CTGGAAGCAG
CGGCGACAGA TGGTCTCGCC GGTGCTGCAC ACCTCGCGCA TGGACCAGTT CGCCCCCGCG
ATGGTCGAGA CCGTCGGCGA ACTGGCCGAC AGATGGGCCG CCCTTCCCGA CGGGGCGACC
TTCGACGTGC TGAAGGTCAT GGCGCAACTG ACGGCGGAAA TCATCTGCCG CGCGGTATTC
GGCCGGACGC TGGGGGCGGA GCATGCGCGC GAGGTGGCGG AAGCCTTTAC CGAATACCAG
AAATATGTCG ATCAGAGCGA TCTCGCGTCG TTCGGCCTGC CATCCTGGGT TCCGCGCCGG
AACGGGGCCA AGACCCGCCG CGCGACGGCC CGGATTCACG CTGTCCTCGA CGGGATCATC
GCTGATCTTC AGCGCACCGA GGATGATGGC TCGGTCATCC GGATGCTGAT GCGCGATGGG
GTGCTGGACG CCACCGCGCT GCGGAACGAG GCCGCCGTCA TCTTCCTGGC CGGGCATGAA
ACCACGGCCA ACTGCCTGTC CTGGGTGTGG TACCTGCTGT CGCAGGCGCC GGAGGTCGAG
GCCCGCCTGC ACGAGGAACT CGATACCGTC CTCGGTTCGC GGGCACCGAC CTTCGCGGAT
GTGTCGCAAC TGGTCTATAC GCGTGCCATC GTCGAGGAGA CCCTGCGCCT CTACCCGCCC
GTGCCCCTGC TGGCGCGGGA GGCGAAGGAG GACGACACCA TCCGCAGCCG CAAGGTGAAG
GCCGGCGCGC TGGTCATGGT GGTGCCGTGG CTGCTGCACC GGCACCGTCT CTACTGGCGC
AAGCCCGACC ATTTCATGCC CGAGCGGTTC CTGCCCGGCA GTCCGGATGC GCCGCAGAAA
TATACCTACG TGCCGTTCAG CATCGGCCCG CGGATCTGTC CCGGCCTGTC CTTCGGGCTG
GTCGAGGCGA TCATCTGCCT GGCCTCGCTG GCACGCGGCA CGACGTTGCG CCTGGCGCCC
GGCGCGGTGG TCGAGCCGGT CTGCCGCCTG ACCCTGCGCC CCGGCGACAC CCTGCCGATG
ACGGTGTGGA AGCGCACGGC CGCCGCCGGG ACTCGACCGG TCCCGGCAGG CGCGTCGGCC
CAACGCTGTC CGGTCCACCA CGGCTGA
 
Protein sequence
MEFIPPFPPR PKTSLSVFEL LRRGTQNFLN IWEEKAFEYQ TMSMQVLARQ VFICNSPDTV 
RHAFITRAEN FQRKSPQIRN ALSPLLGDGL FVSDGETWKQ RRQMVSPVLH TSRMDQFAPA
MVETVGELAD RWAALPDGAT FDVLKVMAQL TAEIICRAVF GRTLGAEHAR EVAEAFTEYQ
KYVDQSDLAS FGLPSWVPRR NGAKTRRATA RIHAVLDGII ADLQRTEDDG SVIRMLMRDG
VLDATALRNE AAVIFLAGHE TTANCLSWVW YLLSQAPEVE ARLHEELDTV LGSRAPTFAD
VSQLVYTRAI VEETLRLYPP VPLLAREAKE DDTIRSRKVK AGALVMVVPW LLHRHRLYWR
KPDHFMPERF LPGSPDAPQK YTYVPFSIGP RICPGLSFGL VEAIICLASL ARGTTLRLAP
GAVVEPVCRL TLRPGDTLPM TVWKRTAAAG TRPVPAGASA QRCPVHHG