Gene Franean1_4939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4939 
Symbol 
ID5673278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5929218 
End bp5930243 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content74% 
IMG OID641243793 
ProductRNA polymerase factor sigma-70 
Protein accessionYP_001509209 
Protein GI158316701 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02960] RNA polymerase sigma-70 factor, TIGR02960 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.553857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.665559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGACG TTCCTCCCGG CGGCAGAACC GGCGGGGCTG CTCTGGCCGG GGCGGAGGCG 
GGTGTCGAGA ATCTTCTCGA GCCGTACCGC CGCGAGCTGA CCGGCTATTG CTACCGAATG
CTCGGCTCGC CGTTCGAGGC GGAGGACGCC GTGCAGGACA CGATGATCCG CGCCTGGCGC
GGAATCGACC GGTTCGAAGG CCGGGCCGCG CTGCGGTCGT GGCTGTACCG CATCGCGACG
AACGTGTGCC TGTCGATGCT CGGCGCCAGT CAGCGGCGGG CACGGCCGAT GGACCTCGCC
GGCCCGTCGG CGGCCGACTC CCCGCTGCCC GCGCCGCTGC CGGAGACGGC CTGGATCGTG
CCCGCGCCGG ACGGCCAGGT CTACTCCAGC GCCGCCGACG CGGCCGACCC CGCCGATCTC
GCCGCCCGTC GCGAGACGAT CAGGCTCGCG TTCGTCGTCG CGCTGCAGCA CCTGCCGGCC
CGCCAGCGGG CGGTGCTCAT CCTGCGCGAG GTGTTCGGCT GGCCGGCCGC CGAGGTCGCC
GACCTGCTGG AGACCTCCGT CGCCTCGGTC AACAGCGCCC TGCAGCGGGC GCGCGCCACG
ATCGCCGCGA CGGAGATCTC CGACGCCGAC CCGCTGCGCC CCGCCGACGC CGAGCAGCGG
GAGCTGCTCG CCCGCTATGT CGACGCGTTC GAGCGGTACG ACCTCGAGTC CCTCGCGGCG
CTGCTGCACG AGGACGTCAC GATGTCGATG CCGCCGCTGG GCCTGTGGCT GCGGGGCCAC
GCCGACGTCC GGGCATGGAT GCTCGGTACG GGCCAGGGCT GCCGGGGCTC GCGGCTGCTG
CCAACCGTGG CCAACGGCCA CCCGGCCTTC GGGCAGTACC GGCCCAGCGC CACGGGTTCC
GGGCACGACC CGTGGGGCCT GGTCGTCCTG GAGATCTCGG CCGGGCGGGT CGCCGGCATC
AACACGTTCC TGGACGTCGA ACGCCTCTTC CCGCTGTTCG GCCTGCCGGC CCGGTTGCCT
GGCTAG
 
Protein sequence
MGDVPPGGRT GGAALAGAEA GVENLLEPYR RELTGYCYRM LGSPFEAEDA VQDTMIRAWR 
GIDRFEGRAA LRSWLYRIAT NVCLSMLGAS QRRARPMDLA GPSAADSPLP APLPETAWIV
PAPDGQVYSS AADAADPADL AARRETIRLA FVVALQHLPA RQRAVLILRE VFGWPAAEVA
DLLETSVASV NSALQRARAT IAATEISDAD PLRPADAEQR ELLARYVDAF ERYDLESLAA
LLHEDVTMSM PPLGLWLRGH ADVRAWMLGT GQGCRGSRLL PTVANGHPAF GQYRPSATGS
GHDPWGLVVL EISAGRVAGI NTFLDVERLF PLFGLPARLP G