Gene CPF_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1900 
SymbolspoVB 
ID4200971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2136345 
End bp2137874 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content27% 
IMG OID638082769 
Productstage V sporulation protein B 
Protein accessionYP_696333 
Protein GI110799140 
COG category[V] Defense mechanisms 
COG ID[COG0534] Na+-driven multidrug efflux pump 
TIGRFAM ID[TIGR02900] stage V sporulation protein B 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000710979 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTAAAA ATGATTTCTA TAAAAACTCT TTCATGCTAA CAGCCTCTAA CTTAACAACT 
GGATTATTAG GTTTTATCTT TTCTATGTAC CTATCAAAGG TACTAGGCCC TGAAGGAATG
GGACTTTATG GTATTATAAT GCCTATTTAT AATTTATTTA TATCAATTAT GACAGCTGGT
ATAATTGCAT CCATTTCTAA GATAACTGCT GTATATTCAG CAAGAGATGA TTATAAAAAC
ATAATTCGAA CAATGAAAGT TGTGGCTATT TTTAATTTTA TTTGGTGTCT TATAATAGGA
ATATTTGTAT TTTTCTTATC ACCTATAATC GGACATTTTT GGGCTAAGGA TCCTAGAATA
ATAAAATCTA TAATGGTAAC TTGTCCTGCT ATGATATTTA TAGCACTTTC AAACATATTA
AAGGGATTTT TCTATGGAAC TTCAAAAATC ACTGTTCCTT CTTTTATAGA TATTTTAGAA
AAATCTTTAC GTATCTTTGT TTTAGCTATA TTAATTTTTA TATTTAAAGC TGAAACTTTA
GAGTCTTTAG TTACTTTAGC TTACTTAGCC TTATGTCTTG GGGAACTACA AAGTTTAATA
CTTTTATTTG GATATTTTAA ATACTCAATG AGTAAATTTC CAAAAACAAA TGCAAAAGGG
GAAAGCCGTG CTCAATTATT ATTTGATGTT TTAGTAACCT CAGTACCTTT ATGCTTAAAT
GGATTTTTAA TGAGTATTTT TAGCGTTATC TCAACACTTT TAGTTCCTAA ACGGCTTATA
GTTGCAGGAT TTACTTATTC CCAAGCACTT TCTCTTATAG GAAAATATTC TTCCATGGCA
ATGTCCATAG TAACTTTTCC TATAATTATA GTTTCTTCAA TAAACACTAT GCTAATACCT
GATTTATCTC AAACTTTAAG TAAAGGGAAT TATCTTTCAG CTACTAAAAG AATTAGAGAT
GTTATTAAAA TAGCTTTTTT AATAGGTATT TGTACCACAG TAATTGGACT ATGTGTTCCT
GACTCTTTAG GTAAATTATT CTTTGGAAGA GATGATTTAG GAGAATATAT AAGAATAACA
TCAGTAATGA TGCCAATAGT ATTTACTTCA AATACTATGT ATGGAATTTT AAATGGACTT
GGAAGACAAA ATGTAATTTT AAGAAATACT ATAATAACAG AAGTTTTAGA AGTTACATTG
TTATTTTTCT TAACTGCAAT ACCATCTATA AATATTTATG GTTATGCAAT AACTATGCTT
ATAATTTCAT CACTTTCCCT TTGTTTAAAC CTTTATGAAA TATATAAAAA TATAAATATA
GGTTTATCCT TATCAAACTT CTTAATATAT ATATTAACAG GAGTTTTAAC CTATATATGC
TTAAGTCCAC TTTCACTAAA GCTTTCTTTT ATTGATTTTA GGATTCAAGT TTTAGCTGTA
ACTTCTATAG CAGCTTCTAT ATTCATATTT TTAATAATAA AGGAGAAATT CTCATCAAGG
TTTAGAAAAA TCTCTTTAAA AAGCAGATAG
 
Protein sequence
MSKNDFYKNS FMLTASNLTT GLLGFIFSMY LSKVLGPEGM GLYGIIMPIY NLFISIMTAG 
IIASISKITA VYSARDDYKN IIRTMKVVAI FNFIWCLIIG IFVFFLSPII GHFWAKDPRI
IKSIMVTCPA MIFIALSNIL KGFFYGTSKI TVPSFIDILE KSLRIFVLAI LIFIFKAETL
ESLVTLAYLA LCLGELQSLI LLFGYFKYSM SKFPKTNAKG ESRAQLLFDV LVTSVPLCLN
GFLMSIFSVI STLLVPKRLI VAGFTYSQAL SLIGKYSSMA MSIVTFPIII VSSINTMLIP
DLSQTLSKGN YLSATKRIRD VIKIAFLIGI CTTVIGLCVP DSLGKLFFGR DDLGEYIRIT
SVMMPIVFTS NTMYGILNGL GRQNVILRNT IITEVLEVTL LFFLTAIPSI NIYGYAITML
IISSLSLCLN LYEIYKNINI GLSLSNFLIY ILTGVLTYIC LSPLSLKLSF IDFRIQVLAV
TSIAASIFIF LIIKEKFSSR FRKISLKSR