Gene CPF_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0103 
Symbol 
ID4202213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp123542 
End bp124864 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content31% 
IMG OID638080984 
ProductGTP-binding protein 
Protein accessionYP_694567 
Protein GI110798691 
COG category[R] General function prediction only 
COG ID[COG1160] Predicted GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000228129 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATT TTAATGAAAC TCCTAGAGGA AGTAGAATTC ATATTTCTCT TTTTGGTAAA 
ACAAACTCTG GGAAATCTAG CATAATAAAT GCCCTAACAG GGCAAAACAT TTCTCTAGTA
TCAGACTTTA AGGGAACAAC AACTGACCCT GTTTATAAGG CAATGGAACT TTTACCACTA
GGACCTGTTG TTTTCGTTGA TACAGCTGGT TTTGATGATG AAGGAGAAAT AGGTAAGCTT
AGAGTTGAAA AAACTGAAGA GGTTGTAGGA AAGACTGATG TAGCTCTTAT AACCCTTTCC
CTTTCTGAAA TACTAGAGGC AATAAAATCA AATATAGAAT TTAAAGACAT GCTTTCTAAG
GAAATATTAT GGCTTAATAA ATTAAAAAAG GCTAAGAAAC CAGCTATACT AGTTATTAAC
AAGTGTGATT TAGTTCCTAA TAACCTAATT GAGTCTAAAA TTGATTTAAA GGATATTGAT
AAAACAACTT TATCTAATAA AGATTGCTTT GTTGATAGTA ATTTAAATAA TTCTTTAAAA
GAAATTGGTG AATTATTAGG AATACCTTGT GTTGCCATAA GTGCAAAAAA TAATTTAAAC
ATAAATGAAT TAAAAAAAGA ACTTGTCAAT GTATCACCCT CTTCAATAAC TGAAAGCCCA
ATAATAGGTG ATAAAATCAA AGCTGGAGAT AAAATTCTTT TAGTAGCTCC TCAAGATATA
CAAGCTCCTA AAGGAAGACT TATACTTCCT CAAGTTCAAG TATTAAGAGA TATATTAGAT
TATGGTGGAA TACCAACTAT GGTTACATTA GATAAATTAG ATGAAGGATT AAAAATCTTT
AACGGTAAGC CTGACTTAGT AATAACTGAC TCTCAAGTAT TTAAACAAGT TAATGCAAAA
TTAGATAGAA GTGTTCCTCT TACTTCCTTC TCAATACTTA TGGCTAGATA CAAGGGTGAT
TTAGATAAGT TCTATTCAGG AGCCAAAGCA ATAAAGAATC TTAAAGCAGG TGATAAAGTT
TTAATAGCAG AAGCTTGTAC TCACCATCAA TTAAAAGGTG ATATAGCAAG AGAAAAACTA
CCTACTTGGT TAGAAGAAAC TTGTCCTGGA ATAATAGTTC ATAATTGCTC TGGTAAGGAC
TTTCCTAAGA ATCTTAATGA ATATTCCCTT GTAATTCATT GTGGAGGATG CATGTTTAAC
AAAGCTGAAA TAATGAATAG AATAGGAATA TGTGATTACG CCTTAGTTCC TATAACAAAC
TTTGGTACAT CAATTGCAGA AATTAATAAT ATCTTAGACA GAGTAATGGA ACCCCTTAAG
TAA
 
Protein sequence
MSNFNETPRG SRIHISLFGK TNSGKSSIIN ALTGQNISLV SDFKGTTTDP VYKAMELLPL 
GPVVFVDTAG FDDEGEIGKL RVEKTEEVVG KTDVALITLS LSEILEAIKS NIEFKDMLSK
EILWLNKLKK AKKPAILVIN KCDLVPNNLI ESKIDLKDID KTTLSNKDCF VDSNLNNSLK
EIGELLGIPC VAISAKNNLN INELKKELVN VSPSSITESP IIGDKIKAGD KILLVAPQDI
QAPKGRLILP QVQVLRDILD YGGIPTMVTL DKLDEGLKIF NGKPDLVITD SQVFKQVNAK
LDRSVPLTSF SILMARYKGD LDKFYSGAKA IKNLKAGDKV LIAEACTHHQ LKGDIAREKL
PTWLEETCPG IIVHNCSGKD FPKNLNEYSL VIHCGGCMFN KAEIMNRIGI CDYALVPITN
FGTSIAEINN ILDRVMEPLK