Gene Caul_1226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1226 
SymbolxseA 
ID5898681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1286273 
End bp1287859 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content72% 
IMG OID641561711 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_001682854 
Protein GI167645191 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.901078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0774047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG TCGTCCCCGA TTCCAACGCG CCGGCCTATT CCGTCTCGGA ACTGGCCTTC 
GCCCTGAAGC GCACGCTGGA AACCAGCTAC GCCTTCGTGC GGCTGCGAGG GGAGCTGAGC
AAGGTCACCC ACCACGGCAA CGGCCACGTC TACCTGACCA TCAAGGACGA CAAGTCGGCC
ATCGACGGCG TGGTCTGGAA GGGCAATGTG CGCGGCCTGG GCATCCGGCC CGAGCACGGG
CTGGAAGTGA TCGTCACCGG CAAGATCACC ACCTATCCGG CCGGCTCGCG CTACCAGATC
GTCATCGAGA CCATGGAGGC CGCCGGGGTC GGCGCGCTTC TCGCGCAACT GGAGCGCCTG
AAGGCCAAGC TGAACGCCGA GGGCCTGTTC GCCCCGGACC GCAAACGCCC CCTGCCCTCG
ATGCCCGCCG TGGTCGGGGT GATCACCAGC CCGACCGGGG CGGTGATCCG CGACATTCTC
CATCGCATCC GCGACCGCTG GCCCTGCCAG GTGCTGGTGT GGCCTTGCGT GGTGCAGGGC
GACGCCGCCG CCGGCCAGGT CAGCGCGGCG ATCCGGGGCT TCAACGCCCT CACGCCGGGC
GGACCCGTCC CGCGTCCCGA CATTCTCATC GTCGCCCGGG GCGGCGGCTC GGTCGAGGAC
CTGTGGGCCT TCAACGACGA GGGCCTGGCC CGCACCGTGG CCGAGGGGAC GATCCCGCTG
ATCTCGGCCG TCGGCCATGA GACCGACACC ACCCTGATCG ACTTCGTCTC CGACCGTCGC
GCGCCGACGC CCACCGCCGC CGCCGAGATG GCGACCCCGG TGCTGGCCGA ACTGCGGGCC
TTGGTCAGCG ACTACGACCG GCGGCTGAAC AACTGCGGCG GCCGGGCGGT CGAGGAGCGG
CGGACGCGGC TGACCAGCGC CGCGCGGGGC CTGCCCCGTC CCGCCGACCT GCTGGCCCTG
GCTCAGCAGC GGTTGGACCT GGCCGTGCGC GGCCTGCCCC GGCTCGACGA CCTGACCGCC
CCCGCCGAGC GTCGCTTCAA GGAGGCGGCC GCCCGCCTGG ATACGGCGCT CCAGCGCAAC
ACCGACATCC ACGCCCGCGA CCTGCTGAAG GTCACCGCCC GCCTGTCGCC CGACACCCTG
CACCGCCAGC GCGCCGATGC TGGCCGCCGG GTCGGCGACC TCGGCCGCCG CCTGGACCTG
GCGGCCCGGC GCGTCCCCGA GCGCGTCGCC CAGCACGCCC GCCTGCCGGC TCTGCAGGAT
CGCCTGAACG CCGTCGCCCG GCGCCGGCTG GAGCGCGAGA CCGATCGTCT GGCCGGCCTG
GAGAAGCTGC GCCAGTCGCT CAATCCGACG AGGCCTTTGG AACTGGGTTT TGCGCTCGTG
CACAAGGCAG ACGGAGGGAT CGCACGTTCC GCCTCCGAGC TCGCCAGCGG CGAGCGCGTC
AGGCTGCAAT TCAGGCAGGG CGACCGCGAC GCGGTGATCG ACGGGGAAAG TTCAGGGGTG
CTTCCTCCGT CCGCCGCACC GGCGCCAACG CGCCCGACGC CCAGGCCCAA GCCGGCGTCC
TCCTCGGATC AAGGCAGCCT CTTTTGA
 
Protein sequence
MTDVVPDSNA PAYSVSELAF ALKRTLETSY AFVRLRGELS KVTHHGNGHV YLTIKDDKSA 
IDGVVWKGNV RGLGIRPEHG LEVIVTGKIT TYPAGSRYQI VIETMEAAGV GALLAQLERL
KAKLNAEGLF APDRKRPLPS MPAVVGVITS PTGAVIRDIL HRIRDRWPCQ VLVWPCVVQG
DAAAGQVSAA IRGFNALTPG GPVPRPDILI VARGGGSVED LWAFNDEGLA RTVAEGTIPL
ISAVGHETDT TLIDFVSDRR APTPTAAAEM ATPVLAELRA LVSDYDRRLN NCGGRAVEER
RTRLTSAARG LPRPADLLAL AQQRLDLAVR GLPRLDDLTA PAERRFKEAA ARLDTALQRN
TDIHARDLLK VTARLSPDTL HRQRADAGRR VGDLGRRLDL AARRVPERVA QHARLPALQD
RLNAVARRRL ERETDRLAGL EKLRQSLNPT RPLELGFALV HKADGGIARS ASELASGERV
RLQFRQGDRD AVIDGESSGV LPPSAAPAPT RPTPRPKPAS SSDQGSLF