Gene Clim_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2037 
SymbolxseA 
ID6355541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2248219 
End bp2249616 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content55% 
IMG OID642669632 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_001944045 
Protein GI189347516 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATG TTGCGGTTAT GCGCGAATGG CTCACTGGCA CTAATATAAA GGAATATATC 
CAATCCCGTA AATCACACCC CGACAGGATG CTCCGGCATC AGGCCAGGCA GAAGAACTCC
GGATTATGCA CGGTTTTTTA TTATTCTGTT GAAAGGAATC TTTTTTTCAA CTCTTCTCAT
CTTATTGCGA TGGGCGATGT CACGCTCTCT GTCAGTGAAC TGACCTTCCG GATTAAATCC
GAGCTTGAAA GCATCTTCCC CGTGGTGCGC GTCAAGGGCG AAATTTCGAA TTGCAAACGA
CACAGCTCGG GTCACACCTA CCTGACCCTG AAGGATGACC AGGCACAGAT ACCTGCGGTA
ATCTGGAAAA ACACCGGAAC CCGGATCAGC TTCGATCTCC GTGACGGCAT GGAGGTGATC
GCTGAAGGAC GACTGGAGGT GTACCCGCCT TCAGGGCGCT ATCAGCTTAT CTGCTCCTCG
GTAACCGAAG CCGGTCAGGG GCAGCTGCAG CAGGCGTTCG CCATGCTGCT CCAGAAACTC
GCAAAGGCCG GCTATTTCAA CGCGGAAAGA AAGAAAAAAA TACCGGCAAT ACCCGAAACC
ATCGGCATTA TCACCTCGCC GACCGGAGCC GTGATCGAGG ACATGGGCAG GGTGATCGAA
CGGCGTTTTC CTGCCGTTCG GATTCTGCTC TTTCCCGTCA GGGTACAGGG CGACGATGCG
GCGCGCGAGG TGAAAAGAGG CATCGACTAC TTCAACAATC CGGCCGATCC GCGACACCGC
GCGGATGTGC TGATCGTTGC CCGTGGCGGC GGATCCATGG AAGATCTCCA GGCATTCAAC
GAAGAGATGG TGGCCGAAGC CATCTACCGC TCATCGGTTC CGGTCATCAG TGCCGTCGGC
CATGAAACCG ATATCACCAT AGCTGACATG GTGGCCGATC TCCGTGCGGG AACTCCGTCG
ATTGCAGCGG AACTTGCCGT ACCCGACAGG GGAGATCTGC TGAAAACCAT TGAAAACCAG
CAGATGCGCC AGAGCGCCCT GATGCAGGCA AAGCTCGATG GCGCGAAAAT GGAGATCGAC
TCCCTTCGGC AGAGCTACGC ATTCAACCGG CCGCTGATGC AGCTGCAGCA GTTGTCGGAA
AAAGCCGAAA GCTTTCCGGA ACTGCTTGAC CTGGCCGTCA GGAGGAAATG GCTGCAGAAG
GCGACGGAGT TTGCCGCTGC CAACCAGCAG CTTGCCCTGC TCGATTACCG GAAAATTCTT
CAACGGGGCT ACGCTCTGGT AAAAAAAGAG ACCCGATTCA TAACCGGTTC ATGCGAACTC
GGGCTCTCCG ACCGTGCGGA CATTCTCTTT CATGACGGAA GTGTTGCCGT AACGGTCACC
GGCCCGCCGA CCTCCTGA
 
Protein sequence
MNDVAVMREW LTGTNIKEYI QSRKSHPDRM LRHQARQKNS GLCTVFYYSV ERNLFFNSSH 
LIAMGDVTLS VSELTFRIKS ELESIFPVVR VKGEISNCKR HSSGHTYLTL KDDQAQIPAV
IWKNTGTRIS FDLRDGMEVI AEGRLEVYPP SGRYQLICSS VTEAGQGQLQ QAFAMLLQKL
AKAGYFNAER KKKIPAIPET IGIITSPTGA VIEDMGRVIE RRFPAVRILL FPVRVQGDDA
AREVKRGIDY FNNPADPRHR ADVLIVARGG GSMEDLQAFN EEMVAEAIYR SSVPVISAVG
HETDITIADM VADLRAGTPS IAAELAVPDR GDLLKTIENQ QMRQSALMQA KLDGAKMEID
SLRQSYAFNR PLMQLQQLSE KAESFPELLD LAVRRKWLQK ATEFAAANQQ LALLDYRKIL
QRGYALVKKE TRFITGSCEL GLSDRADILF HDGSVAVTVT GPPTS