Gene Daud_1297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1297 
Symbol 
ID6026455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1372132 
End bp1373892 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content70% 
IMG OID641594114 
Productfibronectin-binding A domain-containing protein 
Protein accessionYP_001717440 
Protein GI169831458 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTACG ACGGGCTGGT GCTCGCGGCG GTCTGCCGCG AACTGGAAGC GAAACTGGCC 
GGCAGCCGGA TCCAGCGGGT GCAGCAGCCG GAAAAGCTGA CCCTGGTGCT GCAGTTCCGC
ACCCCGGGCA TGACCTACCA CCTGCTGCTC TCCGCCCACC CGCAGCAGGC GCGGGTGCAC
CTGACCGGGG AGCGGCTGGA GAACCCCCTC TCGCCCCCGC TCTTTTGCAG CGTGTGCCGC
AAGCACCTGG AGGGCGGGCG GGTGGAGGCC TTTATCCAGC CCGGCTTTGA ACGGGTGCTG
CAGCTCGCCG TCCTGTCCCG GGACGAACTG GGGCGGGAAA GCAAAAAGCT CCTGATCGCC
GAGATCATGG GCCGCCACAG CAACCTGGTG CTGGTCGACG CCGAATCCGG ACTGATCCTG
GACGCCGCCA AGCGCTACAC CCACGCGGTG AGCCGCTACC GGGAGGTCCT GCCGGGCAAG
CCCTACCTGG CCCCGCCCCG GGAGAAGGCC TCCCCCCCAG GGCTTGCGGC CGAGGAATTC
GGCCAGTTGC TGCTGGAGAT GCCCGTGCAC CTTCCGGTCT GGGAGGCGCT GCAGCGGCGG
TTCGAGGGTT TGAGCCCCCT GATGGCCCGC GAGGTGGTGC ACCGGAGCGG GCTGGACACC
GAACTGACCC TGGACTTCTG CGGCGAACAC GAGCTGGTCG CCCTCTGGGA GGCCTGCACC
CGCCTTTTCA CCCGGGCCGG ACAGGGCCTT TTCGAGCCGA CTGTTCTCCT CGGCCCCGGG
GGCGCGGCGG TGGACTTCGC CGCTTTCAGC ATCGGGCACC GGGCGAGCCG GACCGAGAAC
GAGGGAATGA ACGCCCTGGT GGACCGGGTG TCCCGGGTGC GGGCGGGCGG CGAGCGGGTG
AAGGGCCTCC GCGAGGGCCT TTTCGGCGCG GTGTCCCGGG CGCGCAAGCG CCTGGAGAAG
CGCCTGGAGG GGTGCCACGA AACGGCCGCC GACGGGGCGA AGGCGGAAGA GTTGCGGCTG
CTCGGGGAGG TGCTGACCGC CAACTTGTAC CGGCTGGACG GGAACGCCGA GCGGGTCATA
CTGGAGGACT TCTATACCGG GCGGCCGGTG GAGATCACCC TGGATCCGCG GCTCTCTCCC
GCCCAGAACG CGCAGGCTTA CTTCAAGCGG TACAGCAAGC TGCGTAAGGG TGCCGAGCGG
GCCCGCGCGG AACTGGCCGA GCTGGAGGCC GAGGCGGCCT ACCTGGACGC GCTAGAGACG
GCCATCGCCC TGGCCGAGAC CCCGGCGGAC CTGGAGGAGG TCCGGGAGGA ACTGGCCGGC
GCCGGCTATC TCCGTGACAA GGCCGGGGCC AGGCGGCCCA GGAAGGCGGC GGAACCCCGG
CCCCTGGAAC TGCAGACGGC CGACGGGGCC GTGGTGTACG TCGGCCGGAA CAACCGCCAG
AACGACCTGG TCACCTTCAA GATCGGGCGG CCGGACGACA TCTGGCTGCA CACCAAGAAC
ATCCCGGGGG CGCACGTCAT CATCCGCACC GGCGGGCGGC CGGTGAGCGA CCGGACCCTC
CTGGAGGCGG CCTCCTGGGC GGCTTACTTC AGCAAGGCCC GCCAGGGAAA AAAGGTGCCG
GTGGATTACA CCCTGCGCAA ACACGTGCAA AAGCCCAAGG GCGCACGCCC CGGGTTCGTA
ATCTACACCC ACGAGAAAAC GGTGCTGGTG GACCCACGAC CGGTGTGTTT ACCTGGGCAG
GAGGAGTCTG GGGGGCGGTA G
 
Protein sequence
MPYDGLVLAA VCRELEAKLA GSRIQRVQQP EKLTLVLQFR TPGMTYHLLL SAHPQQARVH 
LTGERLENPL SPPLFCSVCR KHLEGGRVEA FIQPGFERVL QLAVLSRDEL GRESKKLLIA
EIMGRHSNLV LVDAESGLIL DAAKRYTHAV SRYREVLPGK PYLAPPREKA SPPGLAAEEF
GQLLLEMPVH LPVWEALQRR FEGLSPLMAR EVVHRSGLDT ELTLDFCGEH ELVALWEACT
RLFTRAGQGL FEPTVLLGPG GAAVDFAAFS IGHRASRTEN EGMNALVDRV SRVRAGGERV
KGLREGLFGA VSRARKRLEK RLEGCHETAA DGAKAEELRL LGEVLTANLY RLDGNAERVI
LEDFYTGRPV EITLDPRLSP AQNAQAYFKR YSKLRKGAER ARAELAELEA EAAYLDALET
AIALAETPAD LEEVREELAG AGYLRDKAGA RRPRKAAEPR PLELQTADGA VVYVGRNNRQ
NDLVTFKIGR PDDIWLHTKN IPGAHVIIRT GGRPVSDRTL LEAASWAAYF SKARQGKKVP
VDYTLRKHVQ KPKGARPGFV IYTHEKTVLV DPRPVCLPGQ EESGGR