Gene Cphy_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1801 
Symbol 
ID5743090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2219386 
End bp2221623 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content37% 
IMG OID641292898 
ProductABC transporter related 
Protein accessionYP_001558909 
Protein GI160879941 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.344909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACATG AATATATTGA GATAGTAGGT GCACGAGCAA ATAACTTAAA GAATGTTAGT 
TTAAAAATTC CAAAAAAGAA GATCACAATT TTTACTGGAG TCTCAGGTTC AGGAAAATCA
TCGATAGTAT TTGAAACAAT AGCTCAGGAA GCTGGTCGGC AATTGAATGA AACCTTTAGC
AAATTCGTTC AAGGTTTTTT ACCTAAGTAT GGTCATCCGG ATGTTGATGC AATTGAAAAT
CTATCGTTAG CGATTATTGT TGATCAAAAG AGGATAGGTG GAAACTCACG TTCCACACTC
GGTACAATTA CGGATATTAA TCCATTACTT AGGTTACTGT TTTCAAGAAT TGGACAGCCG
CATATAGGCC CATCCTCTTA TTTTTCTTTT AATGATCCAA ATGGAATGTG TAAGACTTGT
GAAGGTATTG GAAAGATTGT TACGCTAGAT CTTGATAAAG CGCTAGATAA AGAAAAATCG
TTAAATGAAG GAGCGATTTT GTTACCGGGT TATAAACCTG GATCATGGCA GTGGAAGATG
TATGCTTCAA CTGGATTTTT TGATTGTGAT AAAAAAATAA AGGATTATTC AGAAGAGGAA
TATGAGAAAC TTGTTTATTG TAAACCTGTA AAAATCAATT CGTCCATCAT GGAAGGAATA
AATACCACCT ATGCAGGACT TGTAGAGAAA TTTATAATGC AGAACATCAA AACAGAGTTT
GAAAAATCGG AGGCATCACA GAAGAAAATT GCTCCATATA CAATGGAAGA GCAGTGCCAT
GATTGCGGTG GTAAACGATA CAATGAACGT ACTTTATCTT CGAAAATTAT GGGATATACC
ATTTCAGATT TTACAGCTAT GCAGGTGGAT GAACTTCTGG AAGTAATTCA AAAAATTGAT
GACAATAAAA TAATACCTAT TATTAGGAAT TTGACGGAGC GATTGAATGA TTTAATTCAA
ATAGGGCTAG ATTATGTAAG CCTCGATAGA GAAACCTCTA CCTTATCCGG AGGCGAATCA
CAACGTGTAA AAATGGTCAA GCACCTTACA AGTAGCTTGA CGGATGCAAT TTATATTTTT
GATGAACCAA GCATTGGATT ACACCCAAGA GATGTTCACA GGCTTAATGA ACTGCTGGTA
AAACTTAGGG ATAAAGGCAA TACAGTGATT GTGGTTGAAC ATGACCCTGA TGTGATAAAA
GTCGCCGATT ATATCGTTGA TGTTGGTCCG AAAGCTGGTG TAAACGGCGG TAGAATCATG
TTTGAGGGTA GCTATAGTGA CCTTTTGAAT GCTAAAACAC TTACAGCAGA ATATATCGGA
AGGAGTCTAC CGATTAAGAG TAAGCCAAGG ACAAGCAAGG AATTCTTCGA AACGAAAAAG
AGTAGTTTAC ACAATTTAAA AAATGTTAGT TTAAGAATAC CGAAAGGTAT CTTCACTGTG
GTTACAGGGG TTGCTGGTTC TGGTAAGTCT ACGCTTGTTA ATGGTGTATT TGCGAAGGAA
TATAAAGATG CCATTATTAT CGACCAGTCC GCAGTTAGCG CAAATTTACG TTCCAATCCA
GCAACCTTTA CCGGAGTAAT GGATCATATA CGTAAATTAT TTGCCGATGA AAATAAGGTA
AGCGCTGGAT TATTCAGTTA TAATTCGGAG GGCGCATGTG AGGCTTGTAA AGGCCGAGGT
TTTATAGAAA CAGATCTTTC TTTTATGGAT TCCGTGGAAA CTATCTGTGA AGAGTGCGGT
GGCAAACGAT TTAAGCAAGA TGTTTTGGAG TACAAGTATA ATGCCAAGTC TATTGTTGAA
GTTCTTGATA TGACAGTAGC AGAAGCGGTT GACTTTTTTA CACAAAAGGA AATAATAAAT
AAGTTAAAGT ATATAGTCGA TGTTGGCTTG CATTATATGA CGCTGGGGCA GCCGTTAGAC
ACCCTTTCCG GTGGTGAGTG CCAACGTTTA AAGCTTGCCA AGGAATTAAG TAAAAAGGGT
AATATTTATA TCATGGATGA ACCTACAACT GGTTTGCATA TGTCCGATAT TACCAGCATT
TTGAACCTAA TTGATCATCT TGTTGACAAA GGTAATACCG TTATTGTTAT AGAACACAAC
CTCGATGTAA TTCGTAATGC TGACTGGATT ATAGATGTAG GCGTTGAAGG TGGCAGTAAA
GGTGGGCAAA TCCTTTATGA AGGAATACCT GGTAACCTTA TAAACTGCAA GGACTCTATC
ACTGCAAAAT ACTTGTAA
 
Protein sequence
MEHEYIEIVG ARANNLKNVS LKIPKKKITI FTGVSGSGKS SIVFETIAQE AGRQLNETFS 
KFVQGFLPKY GHPDVDAIEN LSLAIIVDQK RIGGNSRSTL GTITDINPLL RLLFSRIGQP
HIGPSSYFSF NDPNGMCKTC EGIGKIVTLD LDKALDKEKS LNEGAILLPG YKPGSWQWKM
YASTGFFDCD KKIKDYSEEE YEKLVYCKPV KINSSIMEGI NTTYAGLVEK FIMQNIKTEF
EKSEASQKKI APYTMEEQCH DCGGKRYNER TLSSKIMGYT ISDFTAMQVD ELLEVIQKID
DNKIIPIIRN LTERLNDLIQ IGLDYVSLDR ETSTLSGGES QRVKMVKHLT SSLTDAIYIF
DEPSIGLHPR DVHRLNELLV KLRDKGNTVI VVEHDPDVIK VADYIVDVGP KAGVNGGRIM
FEGSYSDLLN AKTLTAEYIG RSLPIKSKPR TSKEFFETKK SSLHNLKNVS LRIPKGIFTV
VTGVAGSGKS TLVNGVFAKE YKDAIIIDQS AVSANLRSNP ATFTGVMDHI RKLFADENKV
SAGLFSYNSE GACEACKGRG FIETDLSFMD SVETICEECG GKRFKQDVLE YKYNAKSIVE
VLDMTVAEAV DFFTQKEIIN KLKYIVDVGL HYMTLGQPLD TLSGGECQRL KLAKELSKKG
NIYIMDEPTT GLHMSDITSI LNLIDHLVDK GNTVIVIEHN LDVIRNADWI IDVGVEGGSK
GGQILYEGIP GNLINCKDSI TAKYL