Gene Bpro_3371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_3371 
Symbol 
ID4013946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp3564726 
End bp3565751 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content70% 
IMG OID637943035 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_550179 
Protein GI91789227 
COG category[S] Function unknown 
COG ID[COG3181] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACTC GTCACCCAAC AACCCTCGCA GCCGGGGGCT GCCTCTCGCG CCGTGGCCTG 
CTCGTGCAGG CGGCAGCTGC CGCGCTGGCG GCGCCCTGGG TGGCCAACAG CCATGCGGCG
GCCAATGTGG CGGGCGGCCG GCCCATCACG CTGGTGGTCT CGTACCCCGC AGGCGGCGGC
GCAGACCTGA TTGCGCGCAT CATCGCGCCG CGCATGGCCG ATGCGCTGGG CCAGAGCGTG
GTCGTGGACA ACAAACCCGG CGCGAGCGGG CAGCTGGCGG CATCACAGGT AGCGCGCGCC
ACGGCGGACG GCACCACGCT GCTGCTGGAT GCATCGTCCT TCGCGGTGAA CCCGTCGCTG
TTTCCCAAGC TGCCGTACGA CAGCGCCAAG GCCTTCACGC CGCTGGCGGT GCTGGCCACC
TTTCCGAACG TGCTGGTGTG CACGCCCGGC TTTTCTGCCA GGTCGGTCAA GGACGTGATC
CAGCTGGCCA GGGCCCGGCC GGGCGAGGTG ACATATGCCT CGTCGGGAAA CGGCTCGGCG
CAACACCTGG CGGGCGCCAT GTTCGAAGGC CGCGCAGGCG TGCAGCTGCT GCACATTCCC
TACCGTGGCG GCGGCCCGGC CCTCAATGAC GTGATGGGCG GACAGGTGCC GCTGTTCTTC
GCCAACGTGG CGTCGTCGCT GGGGCATATC CAGGCGGGCA AGCTGCGGCC GCTGGCGGTG
ACCAGCGCGG TGCGAGCCCG CTCGCTGCCC GATGTGCCCA CCATGGCGGA GGCCGGATTG
GCCGGCTTCG AGGTGCTGGA GTGGAACCCG CTGCTGGCCC CTGCAGGCCT GCCGGCGGAC
GCAAAGGCCA CGCTGGTGGC CGCCATTCGC AAGGCACTGG CCGACCCCGA GGTGCTGGGC
CGCGTGCGCC AATTGGGCGG TGACGTGTTT GCCGATACCT CGCAGCAAAG CGCCGGCAAG
TTCATCGCGG CCCAGCAGGA ACAATGGGCG CGTGTGGTGC GCGAGCGCAA GATCTCGGTG
GGCTGA
 
Protein sequence
MNTRHPTTLA AGGCLSRRGL LVQAAAAALA APWVANSHAA ANVAGGRPIT LVVSYPAGGG 
ADLIARIIAP RMADALGQSV VVDNKPGASG QLAASQVARA TADGTTLLLD ASSFAVNPSL
FPKLPYDSAK AFTPLAVLAT FPNVLVCTPG FSARSVKDVI QLARARPGEV TYASSGNGSA
QHLAGAMFEG RAGVQLLHIP YRGGGPALND VMGGQVPLFF ANVASSLGHI QAGKLRPLAV
TSAVRARSLP DVPTMAEAGL AGFEVLEWNP LLAPAGLPAD AKATLVAAIR KALADPEVLG
RVRQLGGDVF ADTSQQSAGK FIAAQQEQWA RVVRERKISV G