Gene Acid345_3341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3341 
Symbol 
ID4071259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3963700 
End bp3964857 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content59% 
IMG OID637985363 
ProductDNA processing protein DprA, putative 
Protein accessionYP_592416 
Protein GI94970368 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGTCG TTGAACCCAC CACTTCCAAC CGCCAGCGCC AATGGCTGGC GCTCTCGCTC 
ACTCCACAAC TCGGTCCCAC GCGCGGACGG CGCCTCGTCG AACACTTCGG CGATGTCGAA
AGAATCTTCC GCGCCTCTCT GACGGAACTT GAAGCTTCTG GATTGCCCGC AGCCTCCGCG
CAATCGATCG CACTGGGAAA GTCGTACGAA CTGGCGGAAG ACGAGATGGT GAAGACCGCG
CAGGCCGGCG CGAAGATCGT AGCCATTGGC GATCCTGAAT ATCCGCCGCG ATTGATGGAG
ATTTACGATC CGCCACTGGC GCTGCGGGTG CGCGGTGATG TGAGCATCCT GAGCAAGCCG
GGGATGGCAG TGGTAGGCAC CCGGCACCCC ACGCCCTATG GCCTGGGGAT GGCCGAGCGG
CTCTCCTGCG ACCTTGCGGC ACGTGGCGTC ATCATATTAA GTGGGTTGGC GCGCGGAGTG
GATACCGCTG CCCACCGTGG GACGATAAAT GCCCGCGGAA AGACCGTCGC AGTCTTCGGC
ACCGGCGTGG ATGAAATCTA TCCGCGCGCG AACAAGAAGC TGGCCGAACA GATCGTCGAA
TTTGGCGGCG CATTGATCAG TGAGTTCCCG ACGGGAACAT TTCCGGCGCC TCAGAATTTC
CCTATCCGCA ACCGGATCAT CAGCGGGCTC TCGGTCGGTG TACTCGTGAT CGAAGCCGGT
GAATACAGCG GTACGCGTAT CACCGCGCGT TGCGCGCTGG AGCAGTGTCG CGAGGTGTTC
GCAGTCCCCG GCAATGTGAC CAACAAGCTT TCGTGGGGGC CGAATACACT CATCAAGCAG
GGCGCGAAGC TCGTTGCCAC GTGGGAAGAT GTGTGGGAAG AACTGGGCAG CGACATCCGT
TTGCAGATAC CGCCGCCTGA GGCGCTTGCG ACCGAAACGC CCCAGGCAGC ATCTCTATTC
GACCAGCACG AAATGCCGGC TCAAGAACGC AAAGTGTACG CGTTACTGCG TGCGGACGAA
TCGATGCACA TTGATGAATT GATCGAAAAG CTGGATGGAC GGCTGTCTTC GGCAGAGATA
TTTTCGGCAT TATTTGAACT GGAAATGGCG AGCAAAATCA GGCAGATGCC CGGAAAATAC
TACGTTCGTA GCATGTAG
 
Protein sequence
MTVVEPTTSN RQRQWLALSL TPQLGPTRGR RLVEHFGDVE RIFRASLTEL EASGLPAASA 
QSIALGKSYE LAEDEMVKTA QAGAKIVAIG DPEYPPRLME IYDPPLALRV RGDVSILSKP
GMAVVGTRHP TPYGLGMAER LSCDLAARGV IILSGLARGV DTAAHRGTIN ARGKTVAVFG
TGVDEIYPRA NKKLAEQIVE FGGALISEFP TGTFPAPQNF PIRNRIISGL SVGVLVIEAG
EYSGTRITAR CALEQCREVF AVPGNVTNKL SWGPNTLIKQ GAKLVATWED VWEELGSDIR
LQIPPPEALA TETPQAASLF DQHEMPAQER KVYALLRADE SMHIDELIEK LDGRLSSAEI
FSALFELEMA SKIRQMPGKY YVRSM