Gene Franean1_5269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5269 
Symbol 
ID5673603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6337803 
End bp6338834 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content68% 
IMG OID641244124 
Productrod shape-determining protein MreB 
Protein accessionYP_001509533 
Protein GI158317025 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG1077] Actin-like ATPase involved in cell morphogenesis 
TIGRFAM ID[TIGR00904] cell shape determining protein, MreB/Mrl family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.558763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.287821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGTT CGCTGTCATT CCTCGGTCGT GACATGGCCG TCGACCTTGG TACCGCCAAC 
ACGCTCGTCT ACGTCCGCGG CAGGGGGATC GTCCTCAACG AGCCCAGTGT GGTGGCGATC
AATACGACCA CCTCGGGCAT CCTCGCCGTC GGCACCGACG CCAAGCGGAT GATCGGGCGC
ACCCCGGGCA ACGTGGTGGC CGTCCGCCCG CTGAAGGACG GCGTGATCGC CGACTTCGAG
ACCACCGAGC GGATGCTGCG CTACTTCATC CAGAAGGTGC ACCGCCGCCG GCACTTCGCC
AAGCCGCGGC TGGTGGTGTG CGTGCCGTCC GGTATCACCG GGGTGGAGCA GCGGGCCGTC
AAGGACGCCG GCTACCAGGC CGGGGCCCGC AAGGTCTACA TCATCGAGGA ACCGATGGCG
GCGGCGATCG GTGCCGGCCT GCCCGTCCAC GAGCCGACCG GGAACATGGT CGTCGACATC
GGCGGCGGCA CGACCGAGGT GGCGGTCATC TCCCTCGGTG GGATCGTCAC CAGCCAGTCG
ATCCGCACGG CCGGTGACGA GCTCGATACG GCGATCATCT CCTACGTCAA GAAGGAGTAC
TCGCTGATGC TCGGCGAGCG GACCGCCGAG GAGATCAAGA TGGCCATCGG CTCGGCGCAC
AAGATCCCGG ACGAGCCGAG CGCGGAGATC CGCGGCCGTG ACCTCGTCAC GGGCCTGCCC
AAGACGATCG TGGTGACCGC CGAAGAGATC CGCAAGGCCA TCGAGGAGCC GGTGAACGCG
GTGATCGACG CCGTGAAGGT CACCCTCGAC AGGTGCCCTC CGGAGCTCTC CGGCGACATC
ATGGACCGCG GGATCGTGCT GACCGGTGGT GGGGCGCTGC TGCGCGGGCT CGACGAGCGC
CTGCGCCACG AGACCGGGAT GCCGATCCAC ATCGCGGAGA ACCCGCTGCA CTCGGTGGCG
ATGGGCTCGG GCAAGTGCGT CGAGGAGTTC GAGGCGCTCC AGCAGGTCCT GATCTCCGAA
CCGAAACGCT GA
 
Protein sequence
MSSSLSFLGR DMAVDLGTAN TLVYVRGRGI VLNEPSVVAI NTTTSGILAV GTDAKRMIGR 
TPGNVVAVRP LKDGVIADFE TTERMLRYFI QKVHRRRHFA KPRLVVCVPS GITGVEQRAV
KDAGYQAGAR KVYIIEEPMA AAIGAGLPVH EPTGNMVVDI GGGTTEVAVI SLGGIVTSQS
IRTAGDELDT AIISYVKKEY SLMLGERTAE EIKMAIGSAH KIPDEPSAEI RGRDLVTGLP
KTIVVTAEEI RKAIEEPVNA VIDAVKVTLD RCPPELSGDI MDRGIVLTGG GALLRGLDER
LRHETGMPIH IAENPLHSVA MGSGKCVEEF EALQQVLISE PKR