Gene GM21_1528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1528 
Symbol 
ID8136857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1791370 
End bp1792449 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content65% 
IMG OID644869140 
ProductDNA protecting protein DprA 
Protein accessionYP_003021342 
Protein GI253700153 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.42126e-31 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCACT ACTACTGGTT CGCGCTCAAG TCGGCGCCGC TGGTGGGAAA CGTTACCTTC 
CTGCGGCTTT TGTCCCGCTT CGGGACGCCG GAGCGCGCGC TCAAGGCCTC CCTGCAAGAG
CTCTCGGAAG TGAAGGGAGT GAGCCCGCAG GCCGCGGCCT CCATAGCCGG TCACGACTAC
GCTCACCAGG CGTCGAGCGA ATGCGAGAAG GTGCGGGCCT GCGGCGTCGA CGTCATCGAC
ATCCTGTCGC ATCGCTACCC TCGCCTTTTG ATGGAGATCC CGGACCCTCC CCCCTATTTT
TATCTCAAGG GGGCGCTTTC CGGCAGCGAA ACCGCGGTCG CCATGGTCGG TTCCAGGCGG
GCTTCGCAGT ACGGCCTCTG CACCGCGACG AGGCTTGCGC GCGACCTGGC GCTGCAGGGA
ATCACCGTCG TCTCCGGGAT GGCGCGCGGC ATAGACACGG CGGCGCACTG GGGGGCGCTC
AAGGCGGGCG GCCGTAGCGT CGCCGTACTC GGTTGCGGAA TAGACCTCGT CTACCCGCCG
GAGAACGGTG CTCTCTACCA GGCTCTCGCC GATAACGGGG CGCTCATCAG CGAATTCCCC
ATGGGGACCG CGCCTTTGGC TGAGAACTTC CCCCGCCGCA ACAGGATCAT CAGCGCGCTG
TCGCGCGGCG TGCTGGTAGT CGAGGCGGGG GAGGCGAGCG GTTCGCTGAT CACGGCGCAT
TACGCGCTGG AGCAGGGGCG CGAGGTCTTC GCCGTCCCCG GCAACGTCTC GGTGAGCGGC
AGCAGGGGCG CTAACGGCTT GATCAAGGAA GGGGCGAAGC TGGTGGAGCG CGTGGAGGAT
ATCCTGGAGG AGCTAGGCCT GGAGCCGCAG GCGAACCTTC CCCTCCCGAA GCCCTCAAGC
TTCGAGCTCA CCCCGCAGGA GGCGGAGCTT TACGCCCTTC TTTGCCAGGG GGCGCTGCAG
ATCGACGACA TCATCGTGCA GAGCGCGTTG ACAGCAAGCG AGGTTTCCGC TACTTTACTT
CGCTTGGAAA TGAAGGGAGC CATAGTCCAA CTCCCGGGCA AGCGCTTCGC AGTTGCGTGA
 
Protein sequence
MDHYYWFALK SAPLVGNVTF LRLLSRFGTP ERALKASLQE LSEVKGVSPQ AAASIAGHDY 
AHQASSECEK VRACGVDVID ILSHRYPRLL MEIPDPPPYF YLKGALSGSE TAVAMVGSRR
ASQYGLCTAT RLARDLALQG ITVVSGMARG IDTAAHWGAL KAGGRSVAVL GCGIDLVYPP
ENGALYQALA DNGALISEFP MGTAPLAENF PRRNRIISAL SRGVLVVEAG EASGSLITAH
YALEQGREVF AVPGNVSVSG SRGANGLIKE GAKLVERVED ILEELGLEPQ ANLPLPKPSS
FELTPQEAEL YALLCQGALQ IDDIIVQSAL TASEVSATLL RLEMKGAIVQ LPGKRFAVA