Gene Daud_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0401 
Symbol 
ID6026435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp444535 
End bp445842 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content62% 
IMG OID641593245 
ProductMcrBC 5-methylcytosine restriction system component-like protein 
Protein accessionYP_001716583 
Protein GI169830601 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.204424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTTG TGCCGGTAAT GCTTGGGGAA TGGGAAGAGC TGTCACCCAG CGGGGACAGC 
CCGACGAGGG GTCTGTCTTT CCGGCGGGAG CCGGCTGCCC GCGCATTGGC GGCCGACTTG
GCCGAATCCG GGAAATTAGA GATCCGGGAA TTACTGAACG GCCTGGCCAT TCAGTCCAAG
TCCTTTGTGG GCACCATCCG TCTCGGTCCG CTGCAAGTGA CCATCCGGCC CAAGATGACG
GGTTTTCCGC TGGTTGCCCT GCTCCGCTAC GCCTACGGAT TGCGCAATCT CTTTCTTTAT
GGACAGGTTG AGATGGAAAC AACGGATCGG CCTTTTCAAG ATTTGCTCCT GTCCCAATTG
TCCGCCGAGG CGGCCGAATT ACTTTCCCGG GGTCTGCACC GCGCCTACCG GCCGCGACAT
GAATTAATGG CCAGCCCGCG CGGCCGGGTG AACTTTCAAC GGTTGGCGCG TACGGGAGGC
GTCCGACAGT CGGCATTGCC TTGCTATCAT CATCTCCGGC TCGCGGATTG TCTGTCCAAC
CAGGTGCTGG TGGCCGGCCT GCGCTTCGGC GCCGGGTTGA CGGCCGATCT CGAATTGCGA
GCGCGTCTCC GCCGGTTGGC GGCGGTATGC GGTGAGAACG TCACCCCGAT CCGTCTGGAT
TACCACGTCT TCGCCCGGCT CGAACGGGAG GCCAATCGGC TGACCCGCGC CTACGAACCG
GCTTTTCGGC TGACCAAGAT CCTGTACCGG GACGCCGGTG CGGGTTTGGG CCGGGAGGCG
GGTGGACTTC CAGTTCCGGG ATTCTTGTTT GATATGAACC GGTTTTTCCA GGCCGTCCTG
TCCCGTTTCC TGCACGAGAA TCTGGATGGT TTCCGGGTGC AGGATGAGTA CCGGCTGCAA
GGCATGTTCG CCTACGTTCC CGGTTTTAAT CCGCAGCGCA GGCAGGCACC GGCCCCGCGC
CCGGACTTCG TGGTTTTCCG CGGCGGCAGG GTAGCGGCGA TTCTGGACGC CAAGTACCGG
GATCTCTGGG AAAATGCGCT GCCCCGGGAT ATGCTCTACC AGTTGGCGCT GTATGCGTTG
AGCCAGGGCG GGGGCATGCG GGCCGCTATT CTTTATCCCA CTCTTGACCC CCGGGCGTGT
GAGGCGGTAA TCGAGGTGCG GGAGCCGGTT CACGGTATGG GACGGGCGCA GGTGATCCTA
CGCCCGGTGG TTATTGATGA ATTGGCGGAG ATGGTATCCC TGTCCGATCC GGCAACTGCA
AGAAGGCGGA AAGAATACGC CCGTCATTTG GCCTTCGGCG AAAAATGA
 
Protein sequence
MTVVPVMLGE WEELSPSGDS PTRGLSFRRE PAARALAADL AESGKLEIRE LLNGLAIQSK 
SFVGTIRLGP LQVTIRPKMT GFPLVALLRY AYGLRNLFLY GQVEMETTDR PFQDLLLSQL
SAEAAELLSR GLHRAYRPRH ELMASPRGRV NFQRLARTGG VRQSALPCYH HLRLADCLSN
QVLVAGLRFG AGLTADLELR ARLRRLAAVC GENVTPIRLD YHVFARLERE ANRLTRAYEP
AFRLTKILYR DAGAGLGREA GGLPVPGFLF DMNRFFQAVL SRFLHENLDG FRVQDEYRLQ
GMFAYVPGFN PQRRQAPAPR PDFVVFRGGR VAAILDAKYR DLWENALPRD MLYQLALYAL
SQGGGMRAAI LYPTLDPRAC EAVIEVREPV HGMGRAQVIL RPVVIDELAE MVSLSDPATA
RRRKEYARHL AFGEK