Gene Sros_2434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2434 
Symbol 
ID8665720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2654449 
End bp2655789 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID 
Productamino acid transporter 
Protein accessionYP_003338155 
Protein GI271963959 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0159957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCC CCACTCCCGA GGTCGGACAG GTCCCTCTCA AGCGGGCCAT CGGCCCCAAA 
CTCCTGCTGC TCTTCATCGT CGGTGACATC CTCGGCACCG GCGTCTACGC GCTCACCGGC
AAGGTGGCGG GCAAAGTCGG CGGAGCGCTG TGGATACCGT TCCTGGTCGG GTTCGTCATC
GCCGCGCTGA CGGCGATGTC CTACGTCGAG CTCGTGACGA AGTATCCGAG AGCGGCCGGC
GCGGCCCTCT ACACCCAGCG CGCCTTCCGG ACGCCGTTCC TGACGTTCAT GGTGGCGTTC
ACGGTCATGT GCTCGGGCCT CACCTCGGCC AGCGCGGCGG CGCGCGCCAT CGGAGGCGAC
TACCTGAAGA CGTTCGTCAC GGTGCCCGGC GTCATCGTCG GGATCGTCTT CATCGTGGCG
ATCGCGCTGC TGAACTATCG CGGCGTCTCG GAGTCGGTGA AGACCAACAT CGTGTTCACG
ATCATCGAGC TGACCGGGCT CCTGGTGATC ATCGTCATCG GCGTGTACGC GGTCGCCACC
GGCGCGGGGG AGCCCGCCCG GCTGACCGAG ATCCGCGCCG ACCAGGAAGG CGGCCTGTTC
CTCGCGCTGC TCGGAAGCAC GGCCCTGGCC TTCTTCGCGT TCGTCGGGTT CGAGGACTCG
GTGAACATGG CCGAGGAGAC CCGGGATCCC TCGCGCAACT TCCCCCGGGC GATCTTCCTC
GGTGTGGCGA TCACCAGCAT GATCTACATT CTCGTCGCCG TCACCTCCTC GCTGCTGGTC
GACCACCGGG TCCTGGAGAA GTCGTCGGGG CCCCTGCTGG AGGTGGTCAA GGCCGGTGGG
ATCAGCTTCC CGCCCCAGCT GTTCGCGGTG ATCGCCATGT TCGCCGTGGC GAACTCCGCG
CTGATCAACA TGATGATGGC CTCCCGGCTG GTGTACGGCC TGGCCAACGA ACGGGTCGTG
CCGCGCGCCC TGGGCCGGGT CGATCCGCGC CGCCGTACAC CGGTGGTCGG CATCATCTTC
ACCACGGCGC TCGCCATCGC CCTGATCTCG ACGGGAGATA TCGCGGGGCT CGGCGACACC
ACCGCCTTCC TGCTGCTGTG CGTCTTCACC ATCGTCAACG TGGCCGTCCT CGTGCTCCGC
AAGGACACCG TGGACCACGC GCACTACCGG GCGCCGACGA TCCTGCCCGT GCTCGGCGCC
GTCCTGGCCT TCGTCCTGGC GAGTCCCCTG ACGGGCCGGC CCGCGGAGGT CTACATCCGG
GCGGGCGTCC TGCTGGGCAT CGGCCTCCTG CTGTGGGTGG TCAACTGGCT GGTGACCCGC
CGCCAGGCCC CCGCCGAGTG A
 
Protein sequence
MTSPTPEVGQ VPLKRAIGPK LLLLFIVGDI LGTGVYALTG KVAGKVGGAL WIPFLVGFVI 
AALTAMSYVE LVTKYPRAAG AALYTQRAFR TPFLTFMVAF TVMCSGLTSA SAAARAIGGD
YLKTFVTVPG VIVGIVFIVA IALLNYRGVS ESVKTNIVFT IIELTGLLVI IVIGVYAVAT
GAGEPARLTE IRADQEGGLF LALLGSTALA FFAFVGFEDS VNMAEETRDP SRNFPRAIFL
GVAITSMIYI LVAVTSSLLV DHRVLEKSSG PLLEVVKAGG ISFPPQLFAV IAMFAVANSA
LINMMMASRL VYGLANERVV PRALGRVDPR RRTPVVGIIF TTALAIALIS TGDIAGLGDT
TAFLLLCVFT IVNVAVLVLR KDTVDHAHYR APTILPVLGA VLAFVLASPL TGRPAEVYIR
AGVLLGIGLL LWVVNWLVTR RQAPAE