Gene Cag_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0042 
Symbol 
ID3747241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp47090 
End bp48994 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content36% 
IMG OID637772568 
Producthypothetical protein 
Protein accessionYP_378364 
Protein GI78188026 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.581488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACAAA AACTCATTAT TAAACGATTT CGAGGCTTTA GCACTCTTGA GGTAGATATC 
CCCAAAGTGT TGCTTCTAAT GGGACCGAAT AGCTCTGGGA AAACTACAGC CCTTCATGCT
ATACGTATTT CATGCCAAGC GGCTTGGATT GCTGTAACAA ATAACATAGC GTGGAAAGTT
GAAGATACTG TTATTATTTT TAAAGATTTT ATTATTCGAG ATATATCGCA ACTGATGCCA
ATTGCTGATT GGCAAGCACT TTTTGTTAAT CAAATTGTTG GTGAACATAC ACATTTTAGC
ATAGAAATTA TCTTTGAAAA AACGGATGCT CTATCATCTA TCTTAATAGA AGGGAAATAT
GCGCGAAATG AAAACCTTAA AATAACAGCA ACTATTGGCG CAGAAACACT CATTAATAAC
CTCAAAAACA TATCAAATCG TTCATCACAA TATAAAAACA TAGCTTTTGA ATTTTTTCAA
AAACATCTAC CTAAAGCAAT TCTTATCCCT CCCTTTTATG GGGTAATTAG AGATGAAGAG
TATCGCGCAA AAGCAGTTGT TGATGCTATG GTTGGCTCTG CTGATCAAAG TCATGTTGTA
CGCAACATGA TTTCACGCTT ATCAACAACG CAATTAGAGC AACTGAATGC TTTTATAAAA
GATATGGTTG GAGCTACATT AGTACAACGT ACTCAAGGCG ATGATATAGA GAAAATATCT
CCGTTACGAG TAACATTTCG CGATACAAAT GGTGAACTTG AACTTTCTGC GGCAGGGGCT
GGTCTTATTA ATCTTATAGC TCTTTATTCA TCACTTGCTC GATGGGAATC AGAAACTATA
GACCGACAAA TTATTTTTCT TCTTGATGAA CCCGAAGCAC ATTTACATCC TCGCTTACAG
GGTTATACTG CTGATAGATT AGCAACCATT ATAACGAATG ATTTTAATGC TCAACTTATT
ATGGCAACCC ACTCTATTGA GATTATCAAT AAAATTGGGG AGCGAGATGA TGCTACAATT
TTTAGAACAG ATCGTTTAAA CAAAGAAAAA GGTGGGCAAC AACTCATAGG ACAAACCCCA
CTACTTGATG ATCTTTCGCA ATGGGCAGAT TTAACGCCAT TCTCTATTAT TAACTTTTTA
GCATCAAAAA GAATCCTTTT TTATGAGGGA AAATCGGATG GCATAATTTT AACAAAATGC
GCAGAAATTC TTTTCCGAAA TAATCCTGAT AAAAAGAAAA AATTTGAAAA ATGGACATTA
ATTCAACTCG AAGGATCGGG CAATAAAAAT ATTGCTCAAC TGTTGGCTCA TCTTATTGAT
TCCAGCACGT TTGCAAGTGT AGCAGAAAAA AAAGATTTCA AAATTGTTGT ACAACTTGAT
AAAGATTATA ACGATGAAGT AGAGCAACTA AAGTTAATCA CGAATCGCGA TATTTCAACA
TTTTACAATA TATGGTCGAA GCATAGTATT GAGTCCTTGT TTTGTGAAAG CGCCACACTC
TATCAATGGT TAAAACCAAA ATATCCTGAC ATTCAGGAAG AGACAATAGA GAAAGCAATT
ATAGCCGCCA ATCAAGATAA TGAGCTGAAC CAATATGCCC GTGAGCAACG TCAAGCAACT
TTATTAAAGC CGCTACAAAA AATATCAGAA AATATAACGG CAACAAATAG GCAAGCCGAT
AATGATATAG CCGCTACTCC CGAAATATGG CAGCGAGGAA AAGATCGATC CAAGGTCATA
TTACACCATA TAAAAACAGC TCTTAGCACA TCGGCAAATT CTCTCAGCAC ATCACTTACT
AAAGTAATTG AAAAAGCCGA TGTGAATCTC TTTCCAGCGG GAAATAGGGC AGTAGTTCCC
TCAGAAATTA AGCAGCTTCT GGATTGGATG GTTACAAACG CATAG
 
Protein sequence
MLQKLIIKRF RGFSTLEVDI PKVLLLMGPN SSGKTTALHA IRISCQAAWI AVTNNIAWKV 
EDTVIIFKDF IIRDISQLMP IADWQALFVN QIVGEHTHFS IEIIFEKTDA LSSILIEGKY
ARNENLKITA TIGAETLINN LKNISNRSSQ YKNIAFEFFQ KHLPKAILIP PFYGVIRDEE
YRAKAVVDAM VGSADQSHVV RNMISRLSTT QLEQLNAFIK DMVGATLVQR TQGDDIEKIS
PLRVTFRDTN GELELSAAGA GLINLIALYS SLARWESETI DRQIIFLLDE PEAHLHPRLQ
GYTADRLATI ITNDFNAQLI MATHSIEIIN KIGERDDATI FRTDRLNKEK GGQQLIGQTP
LLDDLSQWAD LTPFSIINFL ASKRILFYEG KSDGIILTKC AEILFRNNPD KKKKFEKWTL
IQLEGSGNKN IAQLLAHLID SSTFASVAEK KDFKIVVQLD KDYNDEVEQL KLITNRDIST
FYNIWSKHSI ESLFCESATL YQWLKPKYPD IQEETIEKAI IAANQDNELN QYAREQRQAT
LLKPLQKISE NITATNRQAD NDIAATPEIW QRGKDRSKVI LHHIKTALST SANSLSTSLT
KVIEKADVNL FPAGNRAVVP SEIKQLLDWM VTNA