Gene Sare_2743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2743 
Symbol 
ID5705726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3124501 
End bp3125529 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content75% 
IMG OID641272199 
Producttransport system permease protein 
Protein accessionYP_001537569 
Protein GI159038316 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4779] ABC-type enterobactin transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00380134 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0111876 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTCG TACGTACCCC CGGCGGTGTA TCGCTGCGGC TGCGCCCCCG CTCGCTCGCC 
GTCGGCGTGA CCTGCGTGCT GCTCACCCTG GCCGTGGGTG TGCTGGCGCT GGGCAGCGGC
GACTACCCGA TGAGCGCCGC CGACGTGCTC CGCACGCTGA CCGGCGGCGG TACCGCCGCG
GAGGAATTCG TCGTCCACGA ACTGCGGCTG CCCCGCCTCG CCACCGCCAT CGCGGTCGGT
GCCGCGCTGG CCCTGGCCGG CGCGGTCTTC CAGACCCTGG TCCGCAATCC CCTCGGCAGT
CCCGACCTGC TCGGCTTCAC CCAGGGCGCG GCCACCGGCG CGCTCGTGGT GATCGTCGTC
GGTGGCACCA GCGCGATGCT CTCCGGCGCC GCCGCCGTCA GCGGATTCGC CACCGGCCTG
CTGGTGTACG TGATCGCCTG GCGACGCGGC GTGCACGGCT ACCGGCTCGT GCTGGCCGGC
ATCGGGGTCG CCGCCATCCT CACCGGGGTC AACGGGTGGC TGCTCACCCG CGCCCCGCTG
ATGGATGCCG CCCGGGCCGT TCTCTGGCTC ACCGGCAGTC TCGACGGCCG GGGCTGGACA
CACGCCCTAC CCGTCCTGGT CGCCCTCGCC GTGCTCGGGC CGGCGGTGCT GGCCGGCGCG
GGTCCGGCGC TGCGGCTCAT GGAGATGGGG GACGACGCCG CCAGCGCGCT CGGCGTGCCG
GTGCAGCGGC TGCGGCTGGC GCTGCTCGGG GCGGCCGTGC TGCTGGTCTC CCTCGCCTCG
GCCGCCGCCG GCCCGGTCAA CTTCGTGGCG CTCACCGCGC CGCACCTGGC CCGGCGGCTC
ACCCGCGCGC CCGGCCCGAA CCTGCTGCCC TCGGCGCTGC TCGGGGCGCT GCTGTTGGTC
GTCGCCGACC AGGTCGCCCA ACGCGCCATC CCGGGCCAGC AGCTGCCGGT GGGCGTGGTA
ACCGGTTTAC TGGGCGGTGG GTACCTGATC TGGCTACTGG CAGCCGAGCG TCGGGCGGGC
CGGCTGTGA
 
Protein sequence
MIVVRTPGGV SLRLRPRSLA VGVTCVLLTL AVGVLALGSG DYPMSAADVL RTLTGGGTAA 
EEFVVHELRL PRLATAIAVG AALALAGAVF QTLVRNPLGS PDLLGFTQGA ATGALVVIVV
GGTSAMLSGA AAVSGFATGL LVYVIAWRRG VHGYRLVLAG IGVAAILTGV NGWLLTRAPL
MDAARAVLWL TGSLDGRGWT HALPVLVALA VLGPAVLAGA GPALRLMEMG DDAASALGVP
VQRLRLALLG AAVLLVSLAS AAAGPVNFVA LTAPHLARRL TRAPGPNLLP SALLGALLLV
VADQVAQRAI PGQQLPVGVV TGLLGGGYLI WLLAAERRAG RL