Gene Sare_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4420 
Symbol 
ID5705522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4994475 
End bp4995467 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content74% 
IMG OID641273839 
Producthypothetical protein 
Protein accessionYP_001539188 
Protein GI159039935 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4965] Flp pilus assembly protein TadB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.204685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0060004 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCTCGA CACAGCTTCT GGTGCCCCTG CTCGTCGTCG CTCTGACAGT CGTCGTTGCT 
TGGCTGCTGG TAAGGGGCGC CCGACCGCGG GCCCAGACCG CCGGTGGCGT GCGTGGTGCC
GACGACCCAC TGGTCGCCTG GGTGGCCAGC CTGCGGCACG ATGGTCCGCC GTGCAACCGT
GGGCCTGGGC CGGAGCGGGA CCTGGCGTGG TCGAAGCTGG ATTCAGCGCT GCCCCGGCCA
CGAGGGCAGG ACCGTTCGGA AGGGCCATCG ACCCGGCGTG ATTCCCGACG CCCGGTCGGG
GCACCACACC CGGTGGTGGC TGATCGCCGG ATCAGCTCGG TGTCGGGTGC GCGGGCGGTG
GGGGCCAGGC CGACCCATGC CGGAGGCCCA CTGGTGACGG CGCCCCGGCG TAGCCTGCTG
GTGTCGGGTG TGCTGGCCGC CGCCGTGGGA GCCCTGCTGG GTGGTCCGGC CGCCGCCCTG
GTCGCCGGTG TGTACGGCAC GTTGGGCGTC CGGGCGCTGC TGAGGCGACG GACGGCCCGG
CATGCCGAAG GGCTCCGTCG TCGTCACCTT GACCAACTCT GTGACCTTGC CGCAGACCTG
CGGGCCGGAC TGCCGGTGGG CCAGGCCGCC CCGCTGCCGG CAGGTGGCGA GCCGGGAAGC
ACCCTGCTCC GGGCCGCGGT CCGGCTCGCG GACCGCACCG GGGCACCCCT CGCTGAGCTG
CTGGATCGGA TCGATGCGGA CGCCCGGGCC GCTGATCGGG GCCTGGCCGC GGCGGCGGCG
CAGGCGGCTG GGGCACGGGC GACCGCGTGG TTGCTCGCGG CCCTCCCGCT CGGTGGCATC
GGTCTGGGAT TTGGCATCGG TGTCGATCCG GTCGCGGTGC TCCTGCACTC GACGGCTGGT
GTCGCCTGCC TGCTCCTCGC CGTCGTGCTT CAGGTCGCCG GGCTGTTCTG GGCGGAGCGG
CTCACCGCGA TCCGTGGCTG GGACATCGGA TGA
 
Protein sequence
MTSTQLLVPL LVVALTVVVA WLLVRGARPR AQTAGGVRGA DDPLVAWVAS LRHDGPPCNR 
GPGPERDLAW SKLDSALPRP RGQDRSEGPS TRRDSRRPVG APHPVVADRR ISSVSGARAV
GARPTHAGGP LVTAPRRSLL VSGVLAAAVG ALLGGPAAAL VAGVYGTLGV RALLRRRTAR
HAEGLRRRHL DQLCDLAADL RAGLPVGQAA PLPAGGEPGS TLLRAAVRLA DRTGAPLAEL
LDRIDADARA ADRGLAAAAA QAAGARATAW LLAALPLGGI GLGFGIGVDP VAVLLHSTAG
VACLLLAVVL QVAGLFWAER LTAIRGWDIG