Gene Bind_3531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3531 
Symbol 
ID6200613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp4012003 
End bp4013268 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content60% 
IMG OID641707487 
Productcarbamoyl-phosphate synthase, small subunit 
Protein accessionYP_001834577 
Protein GI182680431 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.56559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATTTCT GGATCGAGCC CTTTAAACCC GATTTCAAGG CGCGCATGAC GCAGGATTTG 
CCAAGCTCTC CCCTCCCACT GCGGACAGAT GAAGCCAAAG GCTGGAGCAA GCCGGTCGCG
ACCGCCGTTT TGGTGCTCGC CGACGGCACC GTCCTGCGAG GCTCGGGCTT CGGCGCCATC
GGCGAGGCCG TCGCCGAAGT CTGCTTCAAC ACCGCGATGA CCGGTTATCA GGAAATCCTG
ACCGACCCGT CCTATGCTGA ACAGATCGTC ACCTTCACCT TTCCGCATAT CGGCAATGTC
GGGACCAACG AAGACGATTT CGAAACGACC AATTTCGAGG CGCAAGCGAG CGTGCGCGGG
CTGATCGTTC TCGCACCCAT CACCAATCCT TCGAATCATC GCTCGACGAG CCATTTCGAC
GCCTGGCTGA AATCACGCTC GATCATCGGC CTTTCCGGCA TCGATACACG CGCCCTAACG
ACGCTGATCC GCGAAAAAGG CATGCCCAAT GCCGTCATCG CCCATCATCC GGATGGGATT
TTCGACATCG AGGCCCTGAA AGCCAAGGCT GCGGCCTGGC ACGGCATAGA CGGAATGGAT
CTTGTTCCGC CCGTCACAAG CAGCAAGCCG CATGAATGGA CGGCAACCGG CATCCTTCCC
GCCCGTGCCT TGCAGCCCAA CAATGGCGAG AACAGGCATC GTGTTGTCGC CATTGATTAT
GGCGTCAAGC GCTCGATCCT GCAGCTCTTG ACCGAGGCGG GCTGCGCGGT CACCGTCGTC
CCGGCCACCG CATCAGCGCA AGAGATCGCC GCCCTGGAGC CGGACGGCAT TTTCCTGTCC
AATGGCCCTG GCGATCCCGC CGAAACCGCC AAATATGCGG TGCCGATCAT TCAGGATCTT
CTGGAGCGTA AAATCCCGAC CTTCGGCATT TGCCTCGGCC ATCAGATCCT GGCCCTGGCG
ATTGGCGCCA AAACGCACAA AATGCGGCAA GGCCATCACG GCGCCAATCA TCCGGTCCTC
GACAAGACCA CTGGAAAGGT CGAGATCGTG TCGATGAACC ATGGCTTCGC TGTCGATATC
GAAACCTTGC CGCCACAAGC AGTCGAGACG CATCTCTCTC TTTTCGACGG CACCAATTGC
GGCATTGCGC TCACCGACCG TCCTGCCTTT TCGGTGCAGC ACCATCCTGA GGCCTCACCC
GGCCCGCGCG ACAGTCATTA TCTCTTCCAG CGTTTCGTCA CGCTGATGGA ACAGGCGAAG
GCCTGA
 
Protein sequence
MDFWIEPFKP DFKARMTQDL PSSPLPLRTD EAKGWSKPVA TAVLVLADGT VLRGSGFGAI 
GEAVAEVCFN TAMTGYQEIL TDPSYAEQIV TFTFPHIGNV GTNEDDFETT NFEAQASVRG
LIVLAPITNP SNHRSTSHFD AWLKSRSIIG LSGIDTRALT TLIREKGMPN AVIAHHPDGI
FDIEALKAKA AAWHGIDGMD LVPPVTSSKP HEWTATGILP ARALQPNNGE NRHRVVAIDY
GVKRSILQLL TEAGCAVTVV PATASAQEIA ALEPDGIFLS NGPGDPAETA KYAVPIIQDL
LERKIPTFGI CLGHQILALA IGAKTHKMRQ GHHGANHPVL DKTTGKVEIV SMNHGFAVDI
ETLPPQAVET HLSLFDGTNC GIALTDRPAF SVQHHPEASP GPRDSHYLFQ RFVTLMEQAK
A