Gene Francci3_0244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0244 
Symbol 
ID3903652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp283431 
End bp284636 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content74% 
IMG OID637877572 
Productputative sigma factor 
Protein accessionYP_479361 
Protein GI86738961 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTCGAG GTGCGCCCGT TCCTGACCTC ACCGCCCACC ATCACAGAGT GACCTCTCAG 
ATAAACGAGG CCCTGCTCCG GAGCCTCACC CCGAGCGTGC TAGGGATCCT CGTCCGCCGC
GGAGCCGACT TCGCGGCGGC CGAGGACGCC ATGCAGGACG CGCTGGTCGA GGCGGTCCGC
GTCTGGCCGG CCGACCCGCC GCGGGACCCG AAGGGCTGGC TGGTCACCGT GGCCTGGCGC
CGGTTCCTCG ACGCGACCCG GGCGGACGCC GCCCGCCGCC GGCGTGAGGA CCTCGTCAAC
GAGGAGCCGG CGCCCGGGCC CGCGCCCACG GTGGACGACA CGCTCCAGCT CTACTTCCTG
TGCGCCCACC CCTCGCTGAC GCCGTCGTCC GCGGTCGCGC TCACGCTGCG CGCCGTCGGC
GGGCTGACCA CCCGCCAGAT CGCCCAGGCC TACCTGGTGC CCGAGGCGAC CATGGCGCAG
CGCATCAGCC GTGCCAAGCG CACCATCTCC GGCGTGCGGT TCGGCCAGCC CGGCGACGTC
GCCACCGTGC TGCGCGTCCT CTACTTGGTC TTCAACGAGG GCTACTCCGG CGACGTCGAC
CTTGCCGCCG AGTCCATCCG GCTCACCCGG CAGCTCGCGG CCGCGGTCGA CCATCCCGAG
GTGGCGGGGC TGCTCGCCCT CATGCTGCTC CACCACGCCC GGCGCGTCAC CCGGACCGCG
CCCAACGGCA GCCTGGTGCC GCTCGCCGAG CAGGACCGCA GCCGGTGGGA CACTGAGCTG
ATCGCCGAGG GCGTCAAGAT CCTGCAGGCG GCCCTCGCCC GCGACCGGCT GGGCGAGTTC
CAGGCCCAGG CCGCCATCGC GGCACTCCAC GCTGACGCGC CCACCGCCGA GGAGACTGAC
TGGGTCCAGA TCGTCGAGTG GTACGACGAG CTCGCGCGTC TGACCGACAG CCCGGTCGTC
CGGCTCAACC GCGCAGTGGC CGTCGGCGAG GCCGACGGAC CGCGCGCCGG GCTGGCGGCG
CTCGCGGCGC TGAACGACTC ACTGCCCCGC CACGCCGCGG TGGCGGCGTA CCTCCACGAG
CGCGACGGCG ACCTGGCGAC GGCGGCACGG CTGTACGCCG AGGCGGCCCA CAAGGCACCC
AACCTCGCCG AGCGCGATTA CCTGACGCGC CAGGCCGCCC GGCTCAACGC CCGCCGGTGT
CGCTGA
 
Protein sequence
MARGAPVPDL TAHHHRVTSQ INEALLRSLT PSVLGILVRR GADFAAAEDA MQDALVEAVR 
VWPADPPRDP KGWLVTVAWR RFLDATRADA ARRRREDLVN EEPAPGPAPT VDDTLQLYFL
CAHPSLTPSS AVALTLRAVG GLTTRQIAQA YLVPEATMAQ RISRAKRTIS GVRFGQPGDV
ATVLRVLYLV FNEGYSGDVD LAAESIRLTR QLAAAVDHPE VAGLLALMLL HHARRVTRTA
PNGSLVPLAE QDRSRWDTEL IAEGVKILQA ALARDRLGEF QAQAAIAALH ADAPTAEETD
WVQIVEWYDE LARLTDSPVV RLNRAVAVGE ADGPRAGLAA LAALNDSLPR HAAVAAYLHE
RDGDLATAAR LYAEAAHKAP NLAERDYLTR QAARLNARRC R