Gene Franean1_6022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6022 
Symbol 
ID5674343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7344818 
End bp7345945 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content67% 
IMG OID641244870 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001510272 
Protein GI158317764 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.336367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCGT GCCGGCTCCT CTTCAGGGGC TTCCACGGCG TCATATGGCG GTCGCCGTGT 
TCGGAAGGAA GTTCTCTCAT GCTGATCGCT CAGCGTCCCT CGCTCGCCGA GGACCCGATC
TCCGAGTTCC GGTCGCGCTT CGTGATCGAG CCGCTCGAGC CGGGCTTCGG CTACACCCTC
GGCAACTCGC TGCGCCGCAC CCTGCTGTCC TCCATCCCGG GCGCGGCCGT GACGAGTATC
CGGGTGGACG GCGTCCTCCA CGAGTTCTCC ACCGTTCCCG GGGTCAAGGA GGACGTGACC
GACCTGATCC TGAACCTCAA GGAACTGGTC GTCAGCTCCG ACAACGACGA GCCGACCGTG
ATGTACCTGC GCAAGCAGGG CCCCGGTGAG GTCACCGCGG CCGACATCGC CCCCCCGGCC
GGCGTCGAGG TGCACAACCC CGACCTGCAC CTGGCCACCC TCAACGACAA GGGCAAGCTC
GAGATCGAGC TGACCGTCGA GCGGGGCCGT GGCTATGTCA GCGCCGCCCA GAACAAGCAG
CCGGGCCAGG AGATCGGTCG CATTCCGATC GACTCCATCT ACTCCCCGGT GCTGAAGGTC
ACCTACAAGG TCGAGGCGAC CCGTGTGGAG CAGCGCACGG ACTTCGACCG GCTCATCGTC
GACGTGGAGA CGAAGCAGTC GATCTCCCCA CGGGACGCGA TGGCCAGCGC CGGCAAGACC
CTCGTCGGCC TGTTCGGGCT GGCCCAGGAG CTCAACGCCG AGGCGGAGGG CGTCGACATC
GGCCCGTCCG CGGCGGACGC TGCCCTGGCC GCCGACCTGG CGCTGCCGAT CGAGGAGATG
GACCTGACCG TCCGCTCGTA CAACTGCCTC AAGCGCGAGG GCATCCACAC CATCGGTGAG
CTGGTGTCCC GCAGCGAGGC GGACCTGCTC GACATCCGCA ACTTCGGGCA GAAGTCGATC
GACGAGGTCA AGACCAAGCT GGGTGCCATG GGCCTGCAGC TCAAGGACTC CCCGCCCGGG
TTCGACCCGC GCCAGGCCGT CGACACGTAC GGCACCGACA CGTACAACCC GTCGTTCTCC
GACCCGTCCG ATGACGGTCG CGAGTTCGTC GAGACCGAAC AGTACTGA
 
Protein sequence
MPPCRLLFRG FHGVIWRSPC SEGSSLMLIA QRPSLAEDPI SEFRSRFVIE PLEPGFGYTL 
GNSLRRTLLS SIPGAAVTSI RVDGVLHEFS TVPGVKEDVT DLILNLKELV VSSDNDEPTV
MYLRKQGPGE VTAADIAPPA GVEVHNPDLH LATLNDKGKL EIELTVERGR GYVSAAQNKQ
PGQEIGRIPI DSIYSPVLKV TYKVEATRVE QRTDFDRLIV DVETKQSISP RDAMASAGKT
LVGLFGLAQE LNAEAEGVDI GPSAADAALA ADLALPIEEM DLTVRSYNCL KREGIHTIGE
LVSRSEADLL DIRNFGQKSI DEVKTKLGAM GLQLKDSPPG FDPRQAVDTY GTDTYNPSFS
DPSDDGREFV ETEQY