Gene Gdia_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1402 
Symbol 
ID6974810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1565119 
End bp1566339 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID643390932 
Producthypothetical protein 
Protein accessionYP_002275797 
Protein GI209543568 
COG category[S] Function unknown 
COG ID[COG5441] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCGT CCGTCTATCT TCTGGGAACG GCGGATACCA AGTTCGCGGA ACTGGACTAT 
CTGCGATCGG TCCTGACCGG ACGGGGCGTC GGCAGCCATA TCGTGGATGT CGGCACCCGC
GAGGACCCGT GCGCCGCCGA CATCACCGCG CGACAGGTGG CGGCCTGCCA CCCCGATGGC
GCTGCCGCGG TCTTCTGCGG CGAGCGCGGG CGCGCCATCG CGGCGATGTC CGAGGCGCTG
CGGCGTTTCC TGCCGGGCCG CGCCGACCTG GCCGGCGTGA TCGCGATCGG CGGCTCCGGC
GGCACGGCCC TTGTCGCGCC GGCCCTGCAG GACCTGCCGA TCGGCCTGCC CAAGATCCTG
GTTTCCACCG TCGCATCGGG AAACGTGGCG CCCTATGTGG GCGAATCCGA TCTGTCCATG
GTCTATTCGG TCGTCGACCT TCAGGGGCTG AACCGTATTT CGCGCACCAT CCTGGCCAAC
GCCGCCAACG CCATGGCGGG CATGGTGCTG CACCCCGCGC CGCATGATGC CGGCACGCGT
CCGGCGGTGG GCATCACCAT GTTCGGCGTC ACCACACCCT GCGTTACGGA GGCGATGCAT
ATCCTGACAG GGGATTTCGA ATGTCTTGTC TTCCACGCCA CCGGAACCGG CGGGCGGTCG
ATGGAACGGC TTGTGCGCCA GGGCATGATC GGCGGCGTGC TCGACATCAC CACCACCGAG
TTCTGCGATT TCGTCGCGGG CGGCATCTTC CCCTGCGAGG CCGGGCGCCT GGATGCCGTC
GCCGCGACCG GTGTGCCCTA TGTCGGAAGC TGCGGCGGGC TGGACATGGT CAATTTCGGC
GCCCGGGATA CGGTGCCGGA CCGGTATCGC GACCGGGTTT TCGTGCAGCA TAATCCGTTC
ATCACGCTGA TGCGCACGAC GGCCGAGGAA TGCGGGCAGA TGGGACGCCT GATCGGCGCG
CGCCTCAACC GCTGCCACGG GCCCGTGCGC TTCTATTATC CGGAAAAGGG CTTTTCCCAG
CTCGATCGTC CTGGTCAGCC CTTTCACGAT CCGGCGGCGG ACGCGGCCTT CCGCGACGCG
CTGGCGTCCA CGCTGGAACA GACCGATCGG CGGCGTTTCA TCAGCCTGCC GCTTGCCCTG
AACGACCCGG CCTTCGCCCA GGCCATGGTC ACGGAATTCC GCACCCTTTT CGAGGAGAGT
CATCCCTATG CCCCGCATTG A
 
Protein sequence
MIPSVYLLGT ADTKFAELDY LRSVLTGRGV GSHIVDVGTR EDPCAADITA RQVAACHPDG 
AAAVFCGERG RAIAAMSEAL RRFLPGRADL AGVIAIGGSG GTALVAPALQ DLPIGLPKIL
VSTVASGNVA PYVGESDLSM VYSVVDLQGL NRISRTILAN AANAMAGMVL HPAPHDAGTR
PAVGITMFGV TTPCVTEAMH ILTGDFECLV FHATGTGGRS MERLVRQGMI GGVLDITTTE
FCDFVAGGIF PCEAGRLDAV AATGVPYVGS CGGLDMVNFG ARDTVPDRYR DRVFVQHNPF
ITLMRTTAEE CGQMGRLIGA RLNRCHGPVR FYYPEKGFSQ LDRPGQPFHD PAADAAFRDA
LASTLEQTDR RRFISLPLAL NDPAFAQAMV TEFRTLFEES HPYAPH