Gene TM1040_2294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2294 
Symbol 
ID4078478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2412716 
End bp2413876 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content65% 
IMG OID638007616 
Productcarbamoyl phosphate synthase small subunit 
Protein accessionYP_614288 
Protein GI99082134 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGAAA CCGTTTCGCC GAAGCCAACC GCCTGCCTGG CCCTTGCAGA TGGCACCGTG 
TTCTATGGAC ACGGGTTTGG CGCCACCGGG CGATGCGTGG CCGAGCTGTG CTTTAACACC
GCAATGACCG GCTATCAGGA AATCATGACC GACCCTTCCT ATGCAGGTCA GGTCGTCACC
TTCACCTTCC CCCACATCGG CAACACCGGC GTCACCCCCG AGGATGACGA AACCGCCGAC
CCGGTGGCCG CAGGCATGGT GGTGAAATGG GATCCGACGC TGTCCTCCAA CTGGCGCGCC
ACCGAAGAGC TGAAGTCCTG GCTCACCCGC ACGGGCCGCA TCGCCATCGG CGGCGTGGAC
ACCCGCCGTC TGACTCGCGC GATCCGCCAG CAGGGCGCGC CGCATGTGGC GATGGAGCAT
AACCCGGACG GGAATTTCGA TCTTGAGGCG CTGGTCGCCG CCGCCCGCGC CTGGCCCGGC
CTTGAGGGCA TGGACCTCGC CAAGGACGTG ACCTGCGCGC AGTCCTACCG CTGGGATGAG
ATGCGTTGGG CCTGGCCCGA GGGCTACACC CGTCAGGAAG AGCCCAAGCA CAAGGTGGTC
GCCATCGACT ATGGTGCCAA GCGCAACATC CTGCGCTGCC TCGCCTCGGC GGGCTGCGAT
GTCACCGTGC TGCCGGCCAC CGCAACCTCG GAAGAGGTAC TGGCCCATGG CCCTGATGGT
GTGTTCCTCT CCAATGGCCC CGGCGACCCG GCCGCAACCG GCGCATACGC TGTGCCGATG
ATCAAGGAAA TCCTGGATAA GACCGACTTG CCGGTCTTTG GGATCTGTCT GGGCCACCAG
ATGCTCGCAC TCGCTTTGGG GGCCAAGACC ACCAAGATGA ACCACGGCCA CCACGGCGCC
AACCACCCGG TCAAGGAACA CGGCACCGGC AAGGTGGAGA TCACGTCGAT GAACCACGGC
TTTGCAGTGG ATGCTCAAAC CCTGCCCGAG GGCGTCGAAG AGACCCATGT CTCGCTGTTT
GACGGCTCCA ACTGCGGCAT TCGCATGACC GATCGCCCGG TCTACTCCGT GCAGCACCAC
CCCGAGGCCA GCCCCGGCCC GCAGGACAGT TTCTATCTGT TCGAGCGCTT TGCAGAGGCG
ATGGCCGCGC GCAAGGCCTG A
 
Protein sequence
MVETVSPKPT ACLALADGTV FYGHGFGATG RCVAELCFNT AMTGYQEIMT DPSYAGQVVT 
FTFPHIGNTG VTPEDDETAD PVAAGMVVKW DPTLSSNWRA TEELKSWLTR TGRIAIGGVD
TRRLTRAIRQ QGAPHVAMEH NPDGNFDLEA LVAAARAWPG LEGMDLAKDV TCAQSYRWDE
MRWAWPEGYT RQEEPKHKVV AIDYGAKRNI LRCLASAGCD VTVLPATATS EEVLAHGPDG
VFLSNGPGDP AATGAYAVPM IKEILDKTDL PVFGICLGHQ MLALALGAKT TKMNHGHHGA
NHPVKEHGTG KVEITSMNHG FAVDAQTLPE GVEETHVSLF DGSNCGIRMT DRPVYSVQHH
PEASPGPQDS FYLFERFAEA MAARKA