Gene Franean1_5903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5903 
Symbol 
ID5674224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7171422 
End bp7172948 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content74% 
IMG OID641244751 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_001510153 
Protein GI158317645 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.667474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGCC CGAGAGACGA GCTTTCAGCC GAAAGATCCG ACCTGACCTG GTCGGATGCG 
GCCGAGCCGC CCGCGCCCGC GACCTGGCGT CCACCGAGCC CACGCGAGCC GCACTTCCGG
GAAGCGCGCG TCGCGGAGCC GAATCTCCCA GAGCCGTGTT CCGCGGAGCC GTACTTCACG
GAGGCGCGCC TCGCGGAGCC GACGCCCGCG ATCGGGTCGC ACGCCGCGCT GCCACCCGAG
CTGAGCCCGC GTCTTGTCCG GCGGCGGCGC TCGCCGCTGC GCCGGCTGTC CGTCACGCTG
GTCGCGATGA TCTCGCTGTG CGTCCTGGGG GCCACGTCGG TCGGGTGGGC GGCCTACCGG
CACTTCGACA GCGCGATCGA CCGCAAGGAC TGGACGCCCG TCGCCGGCGC CCGGCCCGCG
GTGGTCCCGG GCGACCTGAA CGTTCTCCTG CTCGGCAACG ACAGCAGGGA GGGCACCCAC
GGCGAGTTCG GCGACCCGGG CGGCACCCGG GCGGACACGA CGATCGTCGC CCACTTCGAC
GCGGACCGCT CCGTCACGCT GGTGTCCTTC CCGCGCGACA CGCTTGTGCC CGTCGTGCCC
GCGGCCGCCG CCACCGCCCC GGACGGCCTG TCGAAAATCG CGGACGTCAT CCCGCTCGCG
GGCGTGCCCG GCCTGATCTC GACCCTCGAG GCGTTCACCG GGCTCAAGAT CGACCACACC
GTCTCGATCA ACCTCGCCGG CTTCCGCGCG ATGACGGATG CCGTCGGCGG CGTCACGGTA
TGCGTCCTGC CGCTGCCCGA CGGCAGTACC CGCAACCTCC GGGACCGGGA GTCGGGGTGG
CGGGGCCAGC TCGGCGAGAA CCGGCTCAAC GGCGACCAGG CGCTGGCGTT CGTCCGCACC
CGCAAGGCGC TGGGCGAGGA GCGGCTGCGC ATCCTGCGCC AGCAGCAGTT CCTCTCCCGG
CTGCTCGACG CGGCGACGAG CGCCGGTGTG CTCACCAATC CCGCGCGGAT CACCAGTCTG
CTCGGCGCCG TCGGTGGCGC GCTGCAGATC AGTGACAGCC TGACCCAGAC GGAGATGCTC
CGGCTGGCGA AGCGGATCAG CGAGCTCGGC CCGGGCGGCC TGCGGTTCAT CACGATCCCG
ACGTATGTCC CGCTGCCCTC GGACGGCGCC GTCGACGAGA TGGGCACGAT CCCACCGCAC
GGCATGGTCC TCCTGCACGA CCCGGCCGGC CTGGAGGCGA TCGTGGGCCC GATGCGGGCC
ACGGCCGAGA ACGGCGGCGG CTCCGGATCG CCCACAGCGC CGGGCGTCTC CGTGGCCGCC
GGCGCCGCGC CGGGGGCAAC CACGCCGGGG GGAACGGCGC AGGCCTCCCC CTCCGCCAGC
CCAAGCGGCA CAGCGTCCGC CGGCACAGCG TCCGCCGGCA CGACGGCCGA CCGCACCGCG
GTCCCCGGCA CGACGCCGCT CCCCGGCACC GCCGCTGGGA CGGCATCGGC GGTGCCGGTG
CCCGCCGACA CCTCCTGCAC CCCCTGA
 
Protein sequence
MRRPRDELSA ERSDLTWSDA AEPPAPATWR PPSPREPHFR EARVAEPNLP EPCSAEPYFT 
EARLAEPTPA IGSHAALPPE LSPRLVRRRR SPLRRLSVTL VAMISLCVLG ATSVGWAAYR
HFDSAIDRKD WTPVAGARPA VVPGDLNVLL LGNDSREGTH GEFGDPGGTR ADTTIVAHFD
ADRSVTLVSF PRDTLVPVVP AAAATAPDGL SKIADVIPLA GVPGLISTLE AFTGLKIDHT
VSINLAGFRA MTDAVGGVTV CVLPLPDGST RNLRDRESGW RGQLGENRLN GDQALAFVRT
RKALGEERLR ILRQQQFLSR LLDAATSAGV LTNPARITSL LGAVGGALQI SDSLTQTEML
RLAKRISELG PGGLRFITIP TYVPLPSDGA VDEMGTIPPH GMVLLHDPAG LEAIVGPMRA
TAENGGGSGS PTAPGVSVAA GAAPGATTPG GTAQASPSAS PSGTASAGTA SAGTTADRTA
VPGTTPLPGT AAGTASAVPV PADTSCTP