Gene Cphy_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2034 
Symbol 
ID5743062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2513265 
End bp2514476 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content37% 
IMG OID641293131 
Productbasic membrane lipoprotein 
Protein accessionYP_001559141 
Protein GI160880173 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAT TAGCAGCAGT GATTTTAGGT GTTACAATGC TTGCAACTGT ATTTACAGGA 
TGCGGGAAAA AAGCCAATGA AAATAATAAT AATAATTATA ATCCAAGTGG TGCGACTGTA
ACAGAAGGTG CCTCTTCAGA CGAAGCAAAC AAAGGTAACA ATGAAGGTAA AAAAGAAGCG
CTTAAGATTG CAATCGTTAG TAGCCCATCT GGTGTAGATG ATGGTAACTT TAATCAGGAT
AATTATACTG GTATCTTAGA TTTCATTAAA AAACATCCAG ATGCAACAGT AACTCCTGTA
AAAGAACCAA CAGGTGATAC AACAGCTGCA ATTCAGGCAG TTGCTGATAT CGTAGCAGAT
TATAATGTTA TTGTATGTTG TGGTTTCCAA TTTGCAGGTA TCGGTACTCT TGCACAAGAT
AACCCAAATG TTGATTTTAT CTTAGTGGAT GCATATCCAA CTGATGCAGA AGGTAATGAA
ATAACAGCTA AGAACATCTA TGCAATGCAG TTTGCTGAGC AGGAAAGCGG ATTTTTCGCT
GGTATGGCAG CTGCATTAGA ATCAAAAACA AAAAAAGTAG CTTTAGTTAG TGGTATCGCA
TATCCATCCA ACGTAAATTA TCAGTTTGGA TTTGAAAGTG GTGTTAATTA CGCTAACAAG
AAATATGATG CAGGTGTTAA ATTAATTCAA CTACCTTCTT ACGCAGGAAC CGATGTAACT
GGTGCTAATG TTGGTGGTAA CTATGTTGGT AGCTTCTCTG ATGAAGCAAC TGGTAAGGTT
GTAGCAAATG CCTTAATTAA AGAAGGTGTT GACGTAATCT TCGTATCCGC TGGTGGTTCT
GGAAATGGTG TATTTACGGC AGTTAAGGAA GCTACTAATG TTAAAGCAAT CGGTTGTGAC
GTTGACCAGT ACGATGATGG TGTTAATGGA AGTACTAATA TTATCTTAAC CTCTGTTTTA
AAGGTAATGT CTAAGAATGT TGAGAAGCAG TTGAATGCAG TTGTTGATGG AACATTTACT
GGACAGAATG CATTATTAAA AGCTGATACA GATTCTACCG GTTTTGTGGC AACCCAAGGA
AGACAGCAAA TGAGTGCTGA AACAGTCGAA AAAATAGAAG CTGCATATGA ATTAGTAAAG
AATGGTACCA TTGTTCCAGC AGCAAATTTT AACGGAATTA AGTCTGATAG TTTTCCAGGT
TTAGATAAAT AG
 
Protein sequence
MRKLAAVILG VTMLATVFTG CGKKANENNN NNYNPSGATV TEGASSDEAN KGNNEGKKEA 
LKIAIVSSPS GVDDGNFNQD NYTGILDFIK KHPDATVTPV KEPTGDTTAA IQAVADIVAD
YNVIVCCGFQ FAGIGTLAQD NPNVDFILVD AYPTDAEGNE ITAKNIYAMQ FAEQESGFFA
GMAAALESKT KKVALVSGIA YPSNVNYQFG FESGVNYANK KYDAGVKLIQ LPSYAGTDVT
GANVGGNYVG SFSDEATGKV VANALIKEGV DVIFVSAGGS GNGVFTAVKE ATNVKAIGCD
VDQYDDGVNG STNIILTSVL KVMSKNVEKQ LNAVVDGTFT GQNALLKADT DSTGFVATQG
RQQMSAETVE KIEAAYELVK NGTIVPAANF NGIKSDSFPG LDK