Gene Gdia_3032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3032 
Symbol 
ID6976466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3319456 
End bp3320859 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content72% 
IMG OID643392540 
Productcarbamoyl phosphate synthase small subunit 
Protein accessionYP_002277377 
Protein GI209545148 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.11486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0418192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATAT CGATCGAGAC CATCATCGAG CGTCTGGGCG GCGCGGATCA TGCCGCCCGG 
CTGACCGGCG TCGGGACCGA GGCCATCCGC AAATGGCGGC AGGCCCGCGC GATCCCGCCC
AAGCATTGGA CGGTCATCCT GCGCCACACC GGCCTCAGCC TTTCCGACCT GCAACCCGAC
AGCGCGTCCG ACCGAGCGGA GACCCAGATG CCGGAGACCC CGTTACCCCC CGCCACCCAG
CCCCCCGAAG GCGCCACCGC CGCGCTGGTC CTGGCCGACG GCACGGTCGC GTGGGGGCGC
GGCTTCGGCG CGCATACGCC GGCCGCCGGC AGCGCCATCG GCGAACTGTG CTTTTCCACC
GGCATGACCG GCTATCAGGA AACCCTGACC GATCCCTCCT TCGCCGGGCA GATCATCACC
TTCACCTTTC CCCATATCGG CAATGTCGGC ACCAACGCGG ATGACGACGA AGCCCCCCGC
GTGGCCGCGC GCGGGCTGGC GGTCAAGCAG GACCTGACCG AGCCCGCCAA CTGGCGCGCG
ACGCAGGGGC TGGACGCCTG GCTGGCCGGC CAGGGCGTGC CGGGCATCTG CGGCGTCGAT
ACCCGCGCCA TCACGCTGCG GGTGCGCGAC GGCGGCCCGC AGACCGCCAT CCTGGCCTAC
CCCGCCGACG GCGTGTTCGA CCTGGACGCC CTGCGTGCCC AGGCCGCCGC ATGGCCGGGG
CTGGAAGGCA TGGACCTGGC CCGCGACGTG ACCTGCGCCG CCCCCTATTC CTGGGACAAG
GGCGTCTGGA CCTGGCCCGC GGGCACCTGC CCGCTGCCCG AGCGCCGCCG CCGCGTGGTC
GCGGTCGATT ACGGCGCCAA GCGCAACATC CTGCGCTGCC TGGCCAGCGC GGGCTGCGAC
GTGACGGTCG TGCCGGCCAC GGCCACGGCG GACCAGATCC TGGCCCACGC GCCGGACGGC
GTGTTCCTGT CCAACGGCCC GGGCGACCCG GCCGCGACCG CCGAATATGC CGTGCCGGCG
ATCCGCGGCG TGCTGGAGGC CGGCAAGCCG GTCTTCGGCA TCTGCCTGGG CCACCAGTTG
CTGGCGCAGG CGCTGGGCGC GCGCACCTAC AAGCTGGCGC GCGGCCATCG CGGCGCCAAC
CAGCCGGTCA AGGACCTGGG AACCGGGCGG GTCGAGATCA CGAGCCAGAA TCACGGCTTC
GCGGTGGACG AATCCAGCCT GCCCGCCGAC GTGCGCGTGA CCCATACCAG CCTGTTCGAC
GGCTCGAACG AGGGCATCGC CTCCGACCGC TATCCGGCCT TCTCGGTCCA GTACCATCCC
GAGGCCAGCC CCGGCCCGTC GGACAGCCAT TATCTGTTCG ACCGCTTCGT CGCCCTGATC
GACCGCGTCA ACGCACCCGT CTGA
 
Protein sequence
MPISIETIIE RLGGADHAAR LTGVGTEAIR KWRQARAIPP KHWTVILRHT GLSLSDLQPD 
SASDRAETQM PETPLPPATQ PPEGATAALV LADGTVAWGR GFGAHTPAAG SAIGELCFST
GMTGYQETLT DPSFAGQIIT FTFPHIGNVG TNADDDEAPR VAARGLAVKQ DLTEPANWRA
TQGLDAWLAG QGVPGICGVD TRAITLRVRD GGPQTAILAY PADGVFDLDA LRAQAAAWPG
LEGMDLARDV TCAAPYSWDK GVWTWPAGTC PLPERRRRVV AVDYGAKRNI LRCLASAGCD
VTVVPATATA DQILAHAPDG VFLSNGPGDP AATAEYAVPA IRGVLEAGKP VFGICLGHQL
LAQALGARTY KLARGHRGAN QPVKDLGTGR VEITSQNHGF AVDESSLPAD VRVTHTSLFD
GSNEGIASDR YPAFSVQYHP EASPGPSDSH YLFDRFVALI DRVNAPV