Gene Sare_4011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4011 
Symbol 
ID5707433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4562720 
End bp4563862 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content65% 
IMG OID641273436 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_001538792 
Protein GI159039539 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000201414 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGTGG GTAAGGCCGG CAAAGGCGGC AAGAAGCGGC CGTCTGTGTG GGCGGGCGTG 
CCACGGTGGG CCCAGGTGTG CACCGTCTTC GGTGCTGTGC TGATGTTCGT CAGCGGGGCG
GCTCTGGTCG GGGCCGAGGC GCTGATGGCC CGGTACGAGG GTGCGGTGGG TAAGGCGGAC
CTGTTCGGGG ACCAGGCGGC AGGCGCCAGT GAGCGCACGA GCGACATCAA GGGACCGCTC
AGCATCCTGC TGGTCGGTGT TGATCCCCGG AAGCCGGAAC AGCCGCCGTT GGCCGACTCG
ATCATGGTGC TGCACGTGCC GGAGGGCCTC GACCGGGCGT ACCTCTTCTC AATGCCCCGT
GATCTCTACG TTGACATTCC CGCCTTCGAG AAGGCCGGGT TCCCTGGCGG CCAGGACAAG
CTCAACGCCG CGATGGCCTA CGGCAGCCGT CAGCAGGGGG AGAACCCGAG CTCGGCGCAG
GGTTTCGAGC TGCTCGCGAA GACGGTGCAG TCGTTGACCG GCATCAAGCG GTTCGACGCC
GGCGCGATCA TCAATTTCGG TGGGTTCATC AAGATCGTGG ACGCGATGGG CGGTGTCACG
ATGGACATCG AGCGCGAGGT GCGCTCAGAG CATCGTCGTC CCGACGGCAC CCATCGTGAG
CTGCGCCCCG GCGGCGGGGG ATACCTTGGT GAGCAGGCGG TCTACCCGGA AGGTGAACAG
CTCCTCGAGG GTTGGCAGGC GCTGGACTAT GTCCGTCAGC GCTACCCGGC GAACGGCGTG
CCGGATGGCG ACTACGGTCG CCAGCGCCAC CAGCAGCAGT TCGTCAAGGC AATGGCGAGT
CAGGCGTTGA GCGCCGACGT GGTGACCAAT CCGATCAAGC TCGACCGGGT ACTCCGGGCC
GCTGGCGAGT CACTGGTGTT CAACGGCCGG GGGCACAGTG TGATTGACTT TGGTATCGCC
CTCAAGGACC TCCGACCGGG CAACATCCAG ATGATTAAGT TGCCGGGTGG CGGGATCACG
GCTAATGGCA AGTACCAGGG CGAGCGTTTC GAGCCGGCCG TACAGGACTT CTTCCGGGCG
TTGAGAGACG AGCAGCTCGA CGCCTTCCTG CTGGAGCACC CGGACTTTCA GAACAAGGGC
TAA
 
Protein sequence
MTVGKAGKGG KKRPSVWAGV PRWAQVCTVF GAVLMFVSGA ALVGAEALMA RYEGAVGKAD 
LFGDQAAGAS ERTSDIKGPL SILLVGVDPR KPEQPPLADS IMVLHVPEGL DRAYLFSMPR
DLYVDIPAFE KAGFPGGQDK LNAAMAYGSR QQGENPSSAQ GFELLAKTVQ SLTGIKRFDA
GAIINFGGFI KIVDAMGGVT MDIEREVRSE HRRPDGTHRE LRPGGGGYLG EQAVYPEGEQ
LLEGWQALDY VRQRYPANGV PDGDYGRQRH QQQFVKAMAS QALSADVVTN PIKLDRVLRA
AGESLVFNGR GHSVIDFGIA LKDLRPGNIQ MIKLPGGGIT ANGKYQGERF EPAVQDFFRA
LRDEQLDAFL LEHPDFQNKG