Gene TM1040_1945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1945 
SymbolxseA 
ID4076896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2048352 
End bp2049860 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content65% 
IMG OID638007261 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_613940 
Protein GI99081786 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0916178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACC TCCTTGATGA TCCAACCCCG GGCCAGAACG CACCGGAGTT TTCCGTCTCC 
GAGATTTCCG GCGAGGTCAA ACGCACGCTT GAGGGCACCT TTGGCCGCAT CCGCGTGCGG
GGCGAGGTCG GGCGTGTGTT CAAGGCGCGC TCCGGTCATC TTTATTACGA CATCAAGGAT
GATCGCTCGG TGCTGGCCTG CACGACCTGG AAGGGCCAGA TTTCGGGACT GTCCGTGGTG
CCCGAAGAAG GGCTTGAGGT GGTGGTGACC GGGCGCCTCA CGGCCTTTGG CGGACAATCC
AAATACAATA TGAATGTCGA TGAGGTCGCG GTTGCAGGCC AGGGCGCGCT GATGGCGCTC
TTGGAAAAGC GCAAGGCGCA ACTGGCCGCT GAAGGGCTGT TTGCACCCGA GCGCAAGAAA
CCGCTGCCCT ATCTGCCGGG GATCATCGGC GTCATCACGT CGCCTTCAGG CGCTGTGATC
CGTGACATCC TGCATCGGCT TCGGGATCGC TTCCCGCGCA AGGTGCTGGT CTGGCCCGTG
GCCGTGCAGG GCAGCAACTC GGCCCCCGAG GTGGCGCGCG CCATCGATGG GTTCAACGCT
CTGACGCCCG GCGGCGCCTT GCCCCGGCCG GACCTGATCA TTGTCGCGCG CGGCGGCGGG
TCCATCGAGG ACCTCTGGGG TTTTAACGAG GAGATCGTCG CCCGCGCCAC CGCCGCAAGT
GACATCCCGC TGATTTCGGC GGTGGGCCAT GAGACGGATA CCACGCTGAT CGACTACGTT
TCGGATCTGC GTGCCCCCAC GCCCACGGCG GCGGCAGAAC ACGCGGTGCC CGTGCGGCTC
GAGTTGTTGG GCTGGGTCGA AAATCAGGGC GCGCGCATGG CCAATGCCGC CAGCCGCGCG
GTGCAGCTGC GCCGCCAGCG GCTCGGAGAT ATGGCGCGCG CTCTGCCGCG CCCGGATACG
CTCTTGGAAA CCCCGCGCCA GCGGCTCGAC AGAGTCTCTG ACCGGCTGCC CAATGCGCTG
ATTTCGGGCG TGCAACGGCG CAAACTCACG CTCAGCGACC GCGCCGCCTC CCTCAGACCC
GCCACCCTGC GCGGTCTTGT TTCCAGCCGT CAGGACAAGC TCAAAAACCT TTCTTCGCGT
CTCACCCTAC GCCCGATCAC TCAGGATCTG GGGCGCAAAC GAGACGCGCT GGACCGCATC
ACCAAGCGCC TTAACACTGC CCAAAGCAGC CGCATCGACC GCCAGATTGA TCGTCTGTCA
GCCACGGCGC GACAGCTTGA TATTCTGAGC TACAAGGCCA CGTTGCGTCG CGGGTATGCT
GTGGTGCGCG ATGGCGCGGC CCTGGTCACA TCCACCGAAG GCGCCCGGAA GGCCGCTGAA
CTCTCTATCG AATTTGCTGA CGGCACGTTT GATGTCGCCA GCGCCCCCAG CACCACAAAG
AAATCCGCGC CCAAACCGGC GGCCCCGAAG GCGCCCAAAA CGCCCGGAGA GCAGGGCTCT
CTATTCTGA
 
Protein sequence
MSDLLDDPTP GQNAPEFSVS EISGEVKRTL EGTFGRIRVR GEVGRVFKAR SGHLYYDIKD 
DRSVLACTTW KGQISGLSVV PEEGLEVVVT GRLTAFGGQS KYNMNVDEVA VAGQGALMAL
LEKRKAQLAA EGLFAPERKK PLPYLPGIIG VITSPSGAVI RDILHRLRDR FPRKVLVWPV
AVQGSNSAPE VARAIDGFNA LTPGGALPRP DLIIVARGGG SIEDLWGFNE EIVARATAAS
DIPLISAVGH ETDTTLIDYV SDLRAPTPTA AAEHAVPVRL ELLGWVENQG ARMANAASRA
VQLRRQRLGD MARALPRPDT LLETPRQRLD RVSDRLPNAL ISGVQRRKLT LSDRAASLRP
ATLRGLVSSR QDKLKNLSSR LTLRPITQDL GRKRDALDRI TKRLNTAQSS RIDRQIDRLS
ATARQLDILS YKATLRRGYA VVRDGAALVT STEGARKAAE LSIEFADGTF DVASAPSTTK
KSAPKPAAPK APKTPGEQGS LF