Gene Cagg_0507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0507 
Symbol 
ID7267004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp623683 
End bp624771 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content59% 
IMG OID643565370 
ProductDNA protecting protein DprA 
Protein accessionYP_002461882 
Protein GI219847449 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.402405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00180808 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGACG ATACTATTCG CTACTATATC GGGTTTAACT TGACACCGGG AATTGGTCCA 
CTGCGACTGA GCCGTCTGAT CGAGCGATGT GGTTCGGTGG CTGCGGCCTG GCATGCCGAC
GACGCCACCA TGGTAGCTGC CGGTCTCGAT GCACGGAGTA TTGCCAACCT ACGCACTGCC
CGCCAAACCC TTGATCTCGA CGTCGAAGTG GCACGTCTGC GGGCGCACGG TATCACCCCA
CTGGCAATTA CCGATCCGGC CTATCCACCC TTACTACGAA TGATCGCTGC CCCACCCCCA
TTGCTGTATG TGCGTGGGAC AATGACTGCC TCGGATCAAC GAGCCATCGC CATTGTTGGT
ACACGCCATC CTACACCCTA CGGACGGGAA GTGACCCGAC GCTTGGCGCG CGATTTAGCG
GTTGTCGGTA TCACGATTGT CAGCGGCTTA GCCTTGGGTG TCGACACAAT TGCCCATACA
GCGGCCCTCG AAGCCGGTGG GCGTACACTG GCCGTGTTAG CCTGTGGCGC TGACCGCGTG
TATCCAGAAC GGAATCGCAT CCTCGCCGAG CAGATCGTTA CTGCCGGTGC CCTGATCAGC
GATTACCCGC TCGGCACGCC ACCGGCTCCG CTCAACTTTC CACCGCGTAA TCGGATTATT
GCCGGCTTAA GCCTGGCAAC ACTGGTTGTA GAAGCACGTG AAGACAGTGG TGCGCTCATC
ACTGTGCAAT TCGCCCTCGA TCAAGGGCGT GACGTGATGG CAGTTCCAGG GTCGATCTTC
AACCCGCTGA GTGCCGGTCC TCATCGGTTG ATCCGCGAGG GTGCGGCGAT TATCACCAGC
GCGCAGGATG TGCTGGGCGT GCTCAACCTC GAAGGACGTA GTGATCTCCA CGAACCACCG
CTTGAGTTAG CACTTACGGC TGAGGAAGAG GCCATCTACC GCGTGGTCGA AAGTGAACCG
CAGCATATTG ATGTCATTGG GCGTGCTGCC GGACAGGCAG CCGCAACAAC GGCAGCGGCA
TTAGCGTTAC TGGAACTGAA AGGACTTGTG CGGCAAGTCG CACCACTCTA CTATGCACGG
GGTCGCTGA
 
Protein sequence
MIDDTIRYYI GFNLTPGIGP LRLSRLIERC GSVAAAWHAD DATMVAAGLD ARSIANLRTA 
RQTLDLDVEV ARLRAHGITP LAITDPAYPP LLRMIAAPPP LLYVRGTMTA SDQRAIAIVG
TRHPTPYGRE VTRRLARDLA VVGITIVSGL ALGVDTIAHT AALEAGGRTL AVLACGADRV
YPERNRILAE QIVTAGALIS DYPLGTPPAP LNFPPRNRII AGLSLATLVV EAREDSGALI
TVQFALDQGR DVMAVPGSIF NPLSAGPHRL IREGAAIITS AQDVLGVLNL EGRSDLHEPP
LELALTAEEE AIYRVVESEP QHIDVIGRAA GQAAATTAAA LALLELKGLV RQVAPLYYAR
GR