Gene Sterm_2669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_2669 
Symbol 
ID8598125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp2834487 
End bp2837696 
Gene Length3210 bp 
Protein Length1069 aa 
Translation table11 
GC content39% 
IMG OID 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_003309445 
Protein GI269121268 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACG ATTCCATAAA AAAAGTACTT GTAATAGGGT CAGGACCTAT AGTTATAGGT 
CAGGCGGCAG AGTTTGACTA TTCAGGTACA CAGGCATGTG AAGCCTTGAG AGAAGAAGGC
ATAGAGGTAG TATTGATAAA TTCAAATCCG GCAACTATAA TGACAGATAG TAAAACGGCT
GATAAGATAT ATATAGAACC AATGAATATA CAGACTATAG AAAAAATAAT AGCACGAGAA
AAACCAGACT CACTTCTTCC CGGAATGGGC GGTCAGACGG CCCTGAATCT GGCTGTGGAG
CTAAAAGATG CGGGAATACT GGAAAAGTAT AATGTAAATA TAATAGGAAC TTCTATAGAA
TCTATAAAAA AGGGTGAGGA CAGAGAGCTT TTCAGAGAAA GTATGATGAA AATAGGAGAA
CCGGTAATAG ATTCCAGTAT AGTGACAAAC CTTGAGGACG GACAGGATTT TGCCGCAAAG
ATAGGATATC CTGTAGTAGT AAGACCGGCA TATACTCTTG GGGGAAGCGG AGGCGGTATA
GCCGGAAGTC CTGAAGAACT GAAGGATATA CTTTTGAAAG GATTGCAATT ATCAAGAGTG
GGACAGGTTC TTGTAGAGAA ATCAATACTT GGATGGAAAG AGATAGAATA CGAAGTAATA
CGGGATAAAG ACGGGAACTG TATAACTGTC TGCAACATGG AAAATATCGA TCCTGTAGGG
ATACATACAG GAGATTCGAT AGTAACAGCA CCGTCACAGA CACTTTCGGA CAAAGAATAT
CAGATGCTGC GTACATCATC GATAAAAATA ATAAATGAAA TAGGGATAGT GGGAGGATGT
AATGTTCAGT TTGCACTGAA TCCCCTGTCA TTTGAATATG CCATAATAGA AATAAACCCG
AGGGTTTCAA GATCATCAGC GCTTGCTTCC AAAGCAACAG GCTATCCCAT AGCAAGGGTA
GCGGCAAAGC TTTCGCTGGG ATATACGCTG GATGAGGTGA AAAATGAAGT GACTGGGGAG
ACATTTGCAT GCCATGAGCC TGCTATAGAC TATGTGGTAG TAAAAATACC AAAGTGGCCT
TTTGATAAAT TTCAGAAGGT AGACAGAAAG CTTGGAACGA AGATGATGGC CACTGGAGAA
ATAATGGCTA TAGGAAATAA TTTTGAAAGT GCCTTTCTGA AAGGGATAAG ATCACTGGAA
ATAAACCGTG ATAATCTGAT GCATCCGGCA TCGGCGAAAA GAAGTCTTGA AGAACTGAAA
GAAAGAATAC AAAAGCCAGA TGATGAGAGA ATATTTGATC TTGCAGAAAT GCTGAGAAGA
GGTTATGTAA AAAGACAGCT TGCCAGACTT ACAGGAATAG ATATTTTCTT TATAGAGAAG
ATAGAATGGA TAGTAAAACA GGAAGAACAG CTAAAGGATA TGACATTTAA CGATCTGAAT
ATGGAATATC TGAAAAAATT AAAGAGAAAA GGTTTTTCCG ATAAGGGAAT AGCGGAGCTT
ATGGGAATAA GCACAGAAGA TATAAGACTG AAAAGAAAAG AATACGGTAT AAAACCGGTA
TATAAAATGG TAGATACATG TGCAGCAGAA TTTGAGGCGG TTTCCCCATA TTATTATTCT
ACTTATGACA AATATGACGA AGTGGTGGTA AGTGACAGAA GGAAGATAAT AGTAATAGGG
TCAGGGCCTA TACGGATAGG ACAGGGTATA GAGTTTGACT ATTGTACTGT TCATAATATA
GCAGCACTGA AAAAAATGGG AATAGAATCA ATTATAATTA ATAATAATCC GGAAACTGTA
TCAACAGATT TTTCCACAGC TGATAAGCTT TATTTTGAGC CGCTTACAGC TGAGGATGTT
TTGGAAATAG TGGAAAAAGA AAAGCCGGAA GGAGTTATAC TTCAGTTTGG CGGTCAGACT
GCAATAAAGC TCGCAGGAGA ACTCACAAAA AAAGGAATAA AAATAATAGG GACATCTTTT
GAAAAAATAG ATGAGGCAGA GGACAGAGAA AAGTTTGATA AACTTACTGA CAGATTGAAA
ATAAAAAAAC CGGAGGGCAA GGCAGTATGG ACGGCAGAAG AGGGCGTGAA GACAGCAGAA
GCTATAGAAT ATCCCGTTCT GGTAAGACCG TCTTATGTAC TCGGCGGACA GGGAATGGAA
ATATGCTATG ACGAATGCAA CCTGAGAAAA TATCTGGAAT CTGCATTTTA CAGAGACAGT
GAAAATCCCG TTCTTATAGA TAAGTATCTA AACGGGGCGG AGATAGAAGT AGACGCTATA
TGTGACGGTG AGGACATATT GATTCCGAGC ATTATGGAAC ATCTTGAGAG AGCTGGTGTC
CACTCGGGAG ATTCCATAAC TGTGTGTCCT ACTCAGAACG TAAGTGAGGA TATAAAGAAA
AAAGTAGAGG AAATAACAAA AGTACTTGCC AGAGAGCTGG AAATTCTCGG GATGATAAAT
ATACAATTTA TTGCATACAA AGAGGAGCTT TATATAATCG AGGTAAATCC AAGATCATCA
AGAACTGTTC CATATGTATC TAAAATAACA GGTATTCCGG TAATTGAACT GGCAACAAGA
GCAGCTCTGG GAGAAAAGCT GAAGGATATG GGATATGGAA CTGGAGTGTA TAAGGAGCCT
AAACTGATAG CAGTAAAAGT ACCTGTATTT TCCACTGAAA AGATAGAAGG AATTGAAATA
TCACTGGGAC CGGAGATGAA GTCTACCGGT GAAGTACTGG GTGTAGGGAA GACATACGAG
GAAGCTGTCT ACAAAGGACT TCTTGCTGCA GATAAAAAAT ATCCGGAAAG CGGGAAAAAA
GCCCTTGTTA CTCTGAATGA CAATGATAAG GCTGAATTTC TGCCGCTTGC AAAAAAGCTT
GCCAAGCAAA ATTATAAGCT GTCTGCGACA GAAGGAACAT ATAAATTTTT AAAAGAACAC
GGCATAGAAT CCGAGGTAAT AAATAAAATC AGCGGAGACT CGCCTAACAT ACTTGATAAG
CTGAAAAACA GAGAAATAGA CATATTGATA AATACACCTA CCAAAGCAAA TGATTCACAA
AGAGACGGTT TTAAAATAAG AAGAACTGCT GTGGAATACG GGATAGAGGT ACTTACATCA
CTGGACACTC TAAATGCAGT ATTAGGGATA CTGGAAAAAG GACTGCATAA AAAAGAAACA
GAAATTTATG AAATGAACAG TCAAAAATAG
 
Protein sequence
MKNDSIKKVL VIGSGPIVIG QAAEFDYSGT QACEALREEG IEVVLINSNP ATIMTDSKTA 
DKIYIEPMNI QTIEKIIARE KPDSLLPGMG GQTALNLAVE LKDAGILEKY NVNIIGTSIE
SIKKGEDREL FRESMMKIGE PVIDSSIVTN LEDGQDFAAK IGYPVVVRPA YTLGGSGGGI
AGSPEELKDI LLKGLQLSRV GQVLVEKSIL GWKEIEYEVI RDKDGNCITV CNMENIDPVG
IHTGDSIVTA PSQTLSDKEY QMLRTSSIKI INEIGIVGGC NVQFALNPLS FEYAIIEINP
RVSRSSALAS KATGYPIARV AAKLSLGYTL DEVKNEVTGE TFACHEPAID YVVVKIPKWP
FDKFQKVDRK LGTKMMATGE IMAIGNNFES AFLKGIRSLE INRDNLMHPA SAKRSLEELK
ERIQKPDDER IFDLAEMLRR GYVKRQLARL TGIDIFFIEK IEWIVKQEEQ LKDMTFNDLN
MEYLKKLKRK GFSDKGIAEL MGISTEDIRL KRKEYGIKPV YKMVDTCAAE FEAVSPYYYS
TYDKYDEVVV SDRRKIIVIG SGPIRIGQGI EFDYCTVHNI AALKKMGIES IIINNNPETV
STDFSTADKL YFEPLTAEDV LEIVEKEKPE GVILQFGGQT AIKLAGELTK KGIKIIGTSF
EKIDEAEDRE KFDKLTDRLK IKKPEGKAVW TAEEGVKTAE AIEYPVLVRP SYVLGGQGME
ICYDECNLRK YLESAFYRDS ENPVLIDKYL NGAEIEVDAI CDGEDILIPS IMEHLERAGV
HSGDSITVCP TQNVSEDIKK KVEEITKVLA RELEILGMIN IQFIAYKEEL YIIEVNPRSS
RTVPYVSKIT GIPVIELATR AALGEKLKDM GYGTGVYKEP KLIAVKVPVF STEKIEGIEI
SLGPEMKSTG EVLGVGKTYE EAVYKGLLAA DKKYPESGKK ALVTLNDNDK AEFLPLAKKL
AKQNYKLSAT EGTYKFLKEH GIESEVINKI SGDSPNILDK LKNREIDILI NTPTKANDSQ
RDGFKIRRTA VEYGIEVLTS LDTLNAVLGI LEKGLHKKET EIYEMNSQK