Gene Sros_5901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5901 
Symbol 
ID8669195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6469262 
End bp6470797 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content70% 
IMG OID 
ProductFHA domain-containing protein 
Protein accessionYP_003341379 
Protein GI271967183 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0502803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.176709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCACCCC CCCACACGCC GGTCTCGGCC GGTGATCTCA CTCCGGAGCT GCTCGGCACG 
CCCGAATGGA GCGATCAGCT GGTCGCAGCC GCCGGCATTC CCATGTACGA CATTCCGTTC
CTGACGGTCG GCGGTGGCAT GGGCTCCTTC GTGACCCTGG ACTACCTGCG GATCTACGGC
GTGCCCGCCT CCCAGATGCG GGTGGTGTCC AACATCGACA CCCCGTGGCA GACCTACGAG
TTCCTGACCA GGTGCTCGCA GATCCCCCGC TCCGAGCGGA TCCGCTCCGA CTCCGCCTCG
CGGCCCGACA ACATCTGGGG GTTCCCCTCC TATGCGCTGC AGGAGACCTG GACGGACAAG
ACGCCGGCCT ACCTGTGGCA GCTGCTGACC GAGCCGCTCC TGAACGACTA CTGGACGCCG
CGCGCCGGCA CGGTCTTCCA GAGCCTGGAG CGCGAGGCCA AGCGCATCGA CTACTGGGAC
ATGCTGGTCA AGGGCCAGGT CCGGATGGTC CGCCGGCGGG CCGGCGGCGG CTACTTCACC
GTGGTCACCC CGCCCGAGGG CTCGGCGCCC ACCAAGCGCA TCATCTTCCG CTCGCGCTTC
GTGCACATCG CGATCGGCTA CCCCGGCCTG AAGTTCCTGC CCGACCTGCA GGAGTTCCGT
ACCAAGCACG GCGACTACCA GCACGTGGTG AACGCCTACG AGCCGCACGA GCAGGTCTAC
GAGTTCCTCA AGACCCGTCC CGGCACGGTG GTCATCCGGG GCGGCGGCGT CGTGGCCTCC
CGCGTGCTGC AGCGCCTGTT CGACGACCGG GAGAAGTTCC GGCTGCAGAC CCAGATCGTC
CACATCTTCC GGACCTTCGT CACCGGCTCC CACGGCCCGC ACGTCTGGGC GCGGCGCAAG
GGCGGCGACG GCTGGGCCTA CCAGGGCTTC AACTATCCCA AGTCGGTGTG GGGAGGCCAG
CTCAAGGCGC AGATGCGCCG GCTGGAGGGC GCCGAGCGGG CCGCGAAGTA CAAGGAGATG
GGCGGCACCA ACACCCCCTA CCGCCGGCGC TGGCAGGAGC AGATGCGGGC GGGCCGCAGC
GGCGGCTACT ACCACCCCGT GCAGGGCACC GTGGACCGGG TGGAGCGCGG CCCCGACGGC
CGGCTGGTCA GCTACGTGCG CAGCAGCGAC GGCATCGTCC GCGAGCCGGT GGCCGACTAC
ATCATCGACT GCACCGGCCT TGAGGCCGAC ATCGCCGAGC ACCGGATCTA CGAGGACCTG
CTCCGGCACG GCGGGGCCTA CCGCAACCCG GTCGGCCGGC TGGAGGTGGA GCGCCACTTC
GAGGTGAAGG GGACGGCCAG CGGCGACGGC GTCCTCTACG CCTCCGGCTC GGCGACGCTC
GGCGGTTACT TCCCCGGCGT CGACACCTTC CTCGGCCTGC AGATCGCGGC CCAGGAGATC
GCCGACGACC TGGCACGGCG GGGGTTCGTC CGCAGGATGG GGCCGCTCCG GTCGACCTCG
CAGTGGTTCA AATGGGCCTT CAACTCGCCG GTGTAA
 
Protein sequence
MAPPHTPVSA GDLTPELLGT PEWSDQLVAA AGIPMYDIPF LTVGGGMGSF VTLDYLRIYG 
VPASQMRVVS NIDTPWQTYE FLTRCSQIPR SERIRSDSAS RPDNIWGFPS YALQETWTDK
TPAYLWQLLT EPLLNDYWTP RAGTVFQSLE REAKRIDYWD MLVKGQVRMV RRRAGGGYFT
VVTPPEGSAP TKRIIFRSRF VHIAIGYPGL KFLPDLQEFR TKHGDYQHVV NAYEPHEQVY
EFLKTRPGTV VIRGGGVVAS RVLQRLFDDR EKFRLQTQIV HIFRTFVTGS HGPHVWARRK
GGDGWAYQGF NYPKSVWGGQ LKAQMRRLEG AERAAKYKEM GGTNTPYRRR WQEQMRAGRS
GGYYHPVQGT VDRVERGPDG RLVSYVRSSD GIVREPVADY IIDCTGLEAD IAEHRIYEDL
LRHGGAYRNP VGRLEVERHF EVKGTASGDG VLYASGSATL GGYFPGVDTF LGLQIAAQEI
ADDLARRGFV RRMGPLRSTS QWFKWAFNSP V