Gene Gdia_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1983 
Symbol 
ID6975409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2200429 
End bp2201655 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content66% 
IMG OID643391512 
Productprotein of unknown function UPF0118 
Protein accessionYP_002276358 
Protein GI209544129 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCTG TACCCGATCT GGACGATGCC GACATGTTCG AGGCACATCA GCAGCAGCGC 
GCGGCCGAGG TCGTGCGTCG CAGCCGCTTC GATCCCCAGA CCATCTGCCT GCTGATCCTC
ACGGTCCTGG CGGTCTTCTA CACGCTGTAT TTCGCGGCGG CGATCATCCT GCCGATCGTG
CTGGCGCTGG TGGTCAACCT GCTGCTGTCG GCGCCGATGC GGGTGCTGCA TACGCGGCTG
CACCTGCCCA AGACGCTGTC GGCGCTGGTG CTGATCCTGG GCGTGTTCGG GGTGGTGGGC
GCGATCGGCA CCGCGATCTC GGTACCCGCC GCCGGCTGGA TCGCGCGCGC GCCGCAGACC
ATGGCCGCCC TCCAGACGCA CCTGGCCGTC CTGCACCGCC CGATCCAGAT GATCCAGGCG
GCCAATGACC GGATCGAGAA TTTCCTGTCC GTCGTCAGCG GACGGCAAGG GGGCGGTGGC
GGCGGTCAGG TGGTGCTGCT GGCGCCCTCG TCGTCGCCGG GCGGCGGGCT GGGCACGTTC
GGCTCCAGCG TGCTGCTGGG CACGCGTGCC TTCGTGGGCC AGCTCTTCAC CATGATGCTG
ATGCTGTTCT TCCTGCTGGC GCAGGGCGAC AGCCTGCTGC GCCGGTTCGT CGAGATCATG
CCGACCTTCG CCGACAAGCG CCGCGCGGTG CAGATCGCCT ATCAGATCGA ACGCAATGTC
TCGCTCTATC TGACCACCAT CACGATCATC AACGTGCTGG TCGGCCTGGC GAACATGCTG
CAATGCTGGG TGTTCGGCAT GCCGAACCCG CTGCTGTGGG GGGTCCTGGC CTTCCTGCTG
AACTATATTC CCATCATCGG GCCGCTGACC GGCATCGTGA TCTATTTCGT CGTCAGCCTG
TTCGTCTTTC CGTCGGCCCT GCAGGCGCTG CTGCCGCCCA CGGTGTATCT GTGCATCCAC
CTGATGGAAG GCGAGACGAT CACGCCGATG GTGCTGGCCC GGCGCTTCAC CCTCAATCCG
GTGCTGGTCA TGGGCTCGCT GATGTTCTGG GACTGGCTGT GGGGCGTGTG GGGGGCGTTC
CTGTCGGTGC CGATGCTGGC GGTGTTCAAG ATCATCTGCG ACCATGTCGA TGTCCTGACC
CCGATCGGCC ACGTTGTCGG CGGCCCGACC CGCGCACGCA CCGTACAGTC CGCGATCATT
CCCCGGCGGG AACAGGAAAC CGAATAG
 
Protein sequence
MPPVPDLDDA DMFEAHQQQR AAEVVRRSRF DPQTICLLIL TVLAVFYTLY FAAAIILPIV 
LALVVNLLLS APMRVLHTRL HLPKTLSALV LILGVFGVVG AIGTAISVPA AGWIARAPQT
MAALQTHLAV LHRPIQMIQA ANDRIENFLS VVSGRQGGGG GGQVVLLAPS SSPGGGLGTF
GSSVLLGTRA FVGQLFTMML MLFFLLAQGD SLLRRFVEIM PTFADKRRAV QIAYQIERNV
SLYLTTITII NVLVGLANML QCWVFGMPNP LLWGVLAFLL NYIPIIGPLT GIVIYFVVSL
FVFPSALQAL LPPTVYLCIH LMEGETITPM VLARRFTLNP VLVMGSLMFW DWLWGVWGAF
LSVPMLAVFK IICDHVDVLT PIGHVVGGPT RARTVQSAII PRREQETE