Gene Strop_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4039 
Symbol 
ID5060521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4590323 
End bp4591528 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content71% 
IMG OID640476300 
Producttype II secretion system protein E 
Protein accessionYP_001160847 
Protein GI145596550 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGG CGGACGACGG CCTCGCCGCG AGAGTCCGAC AGCGAATCGC GTCCTCGTCA 
ACTCCGGTCA CCCCGGCGGC GATCGTCTCC GCCGTGCGCG CCGAGCCGGC CGCCGCGGTG
CTCGGCGACA CCACGCTGCT CCGCCTCGCC GACCGAGTGC ACGACGACCT GGTGGGCGCC
GGTCCGTTGG CCCCGCTGTT GGTCGATCCA CGCGTCACCG ATGTCCTCGT CAACAACGTT
CACGTCTGGG TCGACCGAGG GGAGGGCCTG AAAGAGGTCG CGGTACCGGT GGGTTCGGTG
GACGACGTCC GGCGGCTCGC GCAGCGGCTC GCAGCCAGCG CGGGGAGGCG GCTGGACGAC
GGCTCGCCGT GCGTCGACGC GCGGCTCGCC GACGGCACCC GACTACACGC GGTCCTGCCA
CCAGTGGCCA CCGACGGTCC ATACCTGTCC CTGCGGACCT TCCGGCAGCG CCCGTTCACG
CTCGACGAGA TGGTTGGTCA GGGGGCCGTG CCACGGGCGG TCGCCCCGGT GCTGGCCGCG
GTGGTGGCGG CCCGGCTCGC GTACCTCGTG GTCGGCGGCA CCGGTTCCGG CAAGACCACA
TTGCTCAACA CGCTACTTGG CCTGGTTTCG GGCACGGAGC GGATCGTGCT GGTAGAGGAC
GCGGCTGAGC TGCACCCCGT ACATCCCCAT GTCGTCGGCC TCCAGGCACG TACAGCCAAT
GTGGAGGGTG TAGGTGCGGT GGGCCTGACC GACCTGGTCC GGCAGGCGCT GCGGATGCGA
CCGGACCGCC TCGTCGTCGG CGAGTGCCGC GGCCGGGAGA TCGTCGACCT GTTGGTGGCC
ATGAATACCG GTCACGAGGG TGGTGCCGGG ACGCTGCACG CCAACACCCC GTCGGATGTG
CCGGCTCGGC TCGAGGCACT CGGTCTCCTC GGTGGATTGC CCCGGCCCGC ACTGCACGCT
CAGGTGGCGG CGGCACTCCA GGTGGTGCTG CATGTCCGTC GCGCGGATCG GGGACGGGCA
CTGGATTCCA TCTGCCTACT TCTGCCGGAA GGCCCTGATC GGCTGGTCAG GGCGGTTCCA
GCCTGGGGCC TTGACAGCGG CCCCGGCCCG GCTGCCCGGA CGCTGGCCGA GTTACTGGGC
AGGCGGGAGG TGGCGGTGCC ACCGATCCTT CGGGGGCCCT GGCCCGGTCA GGCAGGTGTG
GGATGA
 
Protein sequence
MTKADDGLAA RVRQRIASSS TPVTPAAIVS AVRAEPAAAV LGDTTLLRLA DRVHDDLVGA 
GPLAPLLVDP RVTDVLVNNV HVWVDRGEGL KEVAVPVGSV DDVRRLAQRL AASAGRRLDD
GSPCVDARLA DGTRLHAVLP PVATDGPYLS LRTFRQRPFT LDEMVGQGAV PRAVAPVLAA
VVAARLAYLV VGGTGSGKTT LLNTLLGLVS GTERIVLVED AAELHPVHPH VVGLQARTAN
VEGVGAVGLT DLVRQALRMR PDRLVVGECR GREIVDLLVA MNTGHEGGAG TLHANTPSDV
PARLEALGLL GGLPRPALHA QVAAALQVVL HVRRADRGRA LDSICLLLPE GPDRLVRAVP
AWGLDSGPGP AARTLAELLG RREVAVPPIL RGPWPGQAGV G