Gene Gdia_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2024 
Symbol 
ID6975451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2245607 
End bp2247109 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content70% 
IMG OID643391554 
ProductNa+/solute symporter 
Protein accessionYP_002276399 
Protein GI209544170 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.976623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCGG CGCATCCCTC CGGCAGCGTG GCCCCCGGCG GCTTGGCCCC CGGCGGCCTG 
GCCCTGGTCG TCTTCGTCCT GTTCTTTGCC GCGACCATCC TGCTGGGCAG CAGCGTGTCG
TGGTGGCGGG GGCGCGGGAC CGCGCCGCAT GGTGGCGCGG CCGGGTCCGA GGAATGGGGG
CTGGGCGGCC GCCAGTTCGG CACCTGGATC ACCTGGTTCC TGGTCGGCGG CGATTTCTAT
ACCGCCTATA CCATCATCGC GGTGCCGGCG CTGGTCTACG CCACCGGGGC GTTCGGCTTC
TTCGCGCTGC CCTATACCAT CATCGTCTAT CCGTTCGTGT TCCTGGTCAT GCCCGTCCTG
TGGCGGATCG CCCATGACGG GCGGCACGCC ACGGCGGCGG ACATCGTGCG GGCGCGCTTC
GGCAGCCGGG CGCTGGAACT GGCCATCGCC GGCAGCGGGC TGGTGGCCGT CATGCCCTAT
ATCGCGCTGC AGCTCATCGG CATCCGCACC GTCATCGCGG CGCTGGGCCT GCCGGGCGAG
ATCCCGCTGA TCGTCGCCTT CGTCTCTCTG GCCGCCTATA CGTGGCTGGG GGGCCTGCAC
GCGCCCGCGC TGACGGCGTT CATCAAGGAC ATCATGATCT ATGTCGCGGT GCTGGCCGCC
GTCACGGTCA TCCCGCTGCA CCTGGGCGGC TATGGCGCGA TGTTCGCCAG TGCGGCGCGG
CACCTGCCGC ACCCGGCGGA CGGCCGCGCG CTGGCGGCGG GGCAGGGCGT GCCGTACGCC
ACGCTGGCAC TGTCATCCGC CCTGGCGGCG TTCCTGTATC CGCATACGCT GACCGGCGTG
CTGGCCGCCC GGTCGGCCGA CACCATCCGC AAGAACGCGG TGATGCTGCC GGCCTATACG
CTGCTGCTGG GGCTGATCGC CATGCTGGGA CTGATGGCCC ATGCGGCGGG CATCGTCACC
ACCCAGTCGT CATCGGTGGT GCCGCTGCTG TTCCTGACGA TCTTTCCCGA CTGGTTCAGC
GGTTTCTGCT TCGCGGCGGT CGCGGTCGGG GCGCTGGTCC CCGCCGCCGT CATGGCCATC
GGCGCCGCCA ACCTGGTCAC CAGCAACATC GCGCCCGCGT GGCGCCACGA CGTGCGCGCC
GCGCGCGTCA CGGCCCTGGG CGTCAAGCTG GGGGCGCTGG CCTGCGTGCT GTTCCTGAAC
GCGCAATTCG CCATCGACCT GCAATTGCTG GGCAGCCTGT GGATCCTGCA GACCTTTCCC
GCCCTGGTGC TGGGGCTGAC ACGCATTCGC TTTTCCGCCG CGTCCATGCT GCTGGGCTGG
GCGGCGGGCA CGGCATTCGG CACCATCGTC TGTTTCCATG ACGGGCTGAA ACCCACCCAT
CTGCTGGCGC TGGGCGGGAT GCATCTGGCG GTTTCGACCG GATTGCTGGC GCTGCTGGTC
AATATCGGCA CGGCGGCCAT CACGGCAGGG GCGCAGCACC TGCGCCCGGC AAAAGCCCCC
TGA
 
Protein sequence
MMAAHPSGSV APGGLAPGGL ALVVFVLFFA ATILLGSSVS WWRGRGTAPH GGAAGSEEWG 
LGGRQFGTWI TWFLVGGDFY TAYTIIAVPA LVYATGAFGF FALPYTIIVY PFVFLVMPVL
WRIAHDGRHA TAADIVRARF GSRALELAIA GSGLVAVMPY IALQLIGIRT VIAALGLPGE
IPLIVAFVSL AAYTWLGGLH APALTAFIKD IMIYVAVLAA VTVIPLHLGG YGAMFASAAR
HLPHPADGRA LAAGQGVPYA TLALSSALAA FLYPHTLTGV LAARSADTIR KNAVMLPAYT
LLLGLIAMLG LMAHAAGIVT TQSSSVVPLL FLTIFPDWFS GFCFAAVAVG ALVPAAVMAI
GAANLVTSNI APAWRHDVRA ARVTALGVKL GALACVLFLN AQFAIDLQLL GSLWILQTFP
ALVLGLTRIR FSAASMLLGW AAGTAFGTIV CFHDGLKPTH LLALGGMHLA VSTGLLALLV
NIGTAAITAG AQHLRPAKAP