Gene Sros_4159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4159 
Symbol 
ID8667453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4627894 
End bp4629234 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content72% 
IMG OID 
Productsodium/hydrogen exchanger 
Protein accessionYP_003339806 
Protein GI271965610 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.286241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCCG TGACCTCGCC GGTCGGCCTG ATCGGCCCGG ACGAGTTACT GATCTTCCTG 
ATGCAGGTCG GGCTGCTGCT GCTCTGCGCG CTGCTGCTCG GACGGCTGAC CCAACGACTG
GGGATGTCGG CCATCGTCGG CGAGCTGTGC GCGGGCGTGC TGCTCGGCCC CTCGGTGCTG
GCGCGGGTGG CGCCGGGACT GAGCGACTGG CTGCTGCCCC GCGACACCGC CCAGTTCCAC
ATGCTCGACG CGGTCGGCCA GGTCGGCGTG CTGCTGCTCG TCGGCATCAC CGGCATCCAC
ATCGACCTGC GGCTGGTACG CCGCCGGGGA GTCACCGCGG CCCGGATCAG CCTCGCCGGT
GTGGTCATCC CCCTCGCGCT CGGCGTGGCC GTCGGTCTCC TGCTGCCGGA CTCGCTCGTC
CCCGGCTCCG CCGACCGCAG CACGTTCGCG CTGTTCCTGG GGGTGGCGAT GGGCGTCAGC
GCCATCCCGG TCATCGCCAA GACGCTGATG GAGATGCGCC TGCTGCACCG CAACATCGGG
CAGCTGATCC TGTGCGCGGT CACGGTCGAC GACATCGTCG GCTGGCTGCT GCTGTCCGTC
GTCTCCGCCA TGGCGACCAC GGGGGTACGG GCGGGGAACA TCGCCCTCTC CGTCGGCTAC
GTGGCCGCCG TCGTCGCCGT CGCCGTACTC GCCCGCCCGC TCGTACGGAC CGCGCTGCGG
GCGGCGGAGC GCTCCGACAG CAGGAGGGAC GGCGGCGTGA CGGTCACGCT GGTGGTCATC
CTGGTGGTGC TGGCGGCGGC CGCGACGCAG GCCATGAAGC TGGAGGCGGT GTTCGGCGCC
TTCGTGTGCG GGATCGTGAT CAGCAGTTGC GGCACGCTGA ACCCCGCCCG GCTGGCGCCG
CTGCGCACGA CGGTCCTGTC CTTCCTCGCC CCGCTGTTCT TCGCGACCGC CGGGCTGCGG
ATGGACCTCG CCGCCCTGAC CCAGCCGACG GTCCTGCTGG CCGGCGTCGG CGTGCTGCTC
ACCGCGATCC TGGGCAAGTT CGCGGGAGCC TATCTCGGGG CCCGGCTCAG CCGGCTCGGC
CACTGGGAGG CCCTCGCGCT CGGCGCGGGC ATGAACGCCC GCGGCGTCAT CGAGGTCATC
ATCGCGATGG TCGGTCTCCG GCTGGGCGTG CTGAGCGCCG AGATGTACAC GGTCATCGTG
CTGGTGGCCA TCGTGACGTC GCTGATGGCG CCGCCTATCC TGCGCCTGAC GATGGCACGG
GTCGAGCAGA CCGCCGAGGA GCGGCTGCGC GAGGAGAGCC ACGCGGTCGA AAAGGTCCGC
AACACAGGCG GGACCACCTG A
 
Protein sequence
MTPVTSPVGL IGPDELLIFL MQVGLLLLCA LLLGRLTQRL GMSAIVGELC AGVLLGPSVL 
ARVAPGLSDW LLPRDTAQFH MLDAVGQVGV LLLVGITGIH IDLRLVRRRG VTAARISLAG
VVIPLALGVA VGLLLPDSLV PGSADRSTFA LFLGVAMGVS AIPVIAKTLM EMRLLHRNIG
QLILCAVTVD DIVGWLLLSV VSAMATTGVR AGNIALSVGY VAAVVAVAVL ARPLVRTALR
AAERSDSRRD GGVTVTLVVI LVVLAAAATQ AMKLEAVFGA FVCGIVISSC GTLNPARLAP
LRTTVLSFLA PLFFATAGLR MDLAALTQPT VLLAGVGVLL TAILGKFAGA YLGARLSRLG
HWEALALGAG MNARGVIEVI IAMVGLRLGV LSAEMYTVIV LVAIVTSLMA PPILRLTMAR
VEQTAEERLR EESHAVEKVR NTGGTT