Gene Franean1_2245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2245 
Symbol 
ID5670644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2683309 
End bp2684508 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content72% 
IMG OID641241165 
Producthypothetical protein 
Protein accessionYP_001506586 
Protein GI158314078 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.350034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0195881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCGGA CGAACCGGGA CAGTCGTGCG GAACGGGAAG CTCCAACGGA CCCGGACAGT 
CAGGCGGCCC GCGTGGATGA AGATGAAGCG GACCGTGCCG GCCTGTCGGA GGAGAACGAC
CTGACGATTC CGAGTGACGG GCCGGCTCGG AGTGGCGGGG CCGCTACCGC CGGTGGGGTG
GCGCGGAGCG ACGGGACGGA TCGGATGGAC CGGGGGGAGC GGGTGGACCG GGAACGTCGC
GCGGAGGCGC CGGCGGGCCT GCCGACGCCT CCGCAAACAG GCCGGTTCCG TAACCCGGCG
ACCGGCCTGC CCAGGCTGTG GGTCGAGGCC GTCGTCCTGG TGGGGCTCTA CTACGTCTAC
ACGGCCACCC GTGGCGTGGC GGGCTCGTCG GTCGGCGCCG CGACCGACAT GGGCTGGGAC
ATCCTCCGCC TGCAGCAGCA CCTGCACATC GACATCGAGC TCAGCCTCAA CCGGTGGCTG
CAGAGCATCC CGCCGCTGGC GGTCGCCTGC TGCTACTACT ACTCGACCCT GCACTTCGTC
GTCACGCCGG CGCTGCTGGT CTGGATGTAC CGCCGCCACC CCGGCCGCTA CATCCGGGCC
CGGTGGGCCC TGGTCTTCAC CACTCTGATC TCGCTGTGCG GCTTCTTCCT GTTCCCCACC
GCCCCGCCCC GGCTCCTGCC CGGCACCTCG TATGTCGACA CGATGTCGCA CTTCGAGGCC
TGGGGCTGGT GGAGCGGCGG CGCCAGCGCC GCTCCGGACG GCCTCGAGGG GCTGGCCAAC
CAGTACGCGG CCATGCCCTC GCTGCACTGC GCGTGGGCGC TGTGGTGCGG CTTCATGCTG
GCCCGTTTCG CCCGCACACC CCTCGTTCGA GTGATCGGCT GTCTCTATCC CGCTGCGACC
GTGTTCGTCG TGATGGCAAC CTCGAACCAC TACATCCTGG ACGCCGTCGC CGGCTGGGCG
GTGCTCGGCG TGAGCACGCT GCTCTCGCTG GCCATCACCG CCCGCGGCCG ACGCCGGCCG
GCCGAGGCTC CGCCGACTCC CGCTCCGGCT GCCGCCGTGG TGCACCGGCC TGCCGCGGTG
CCTCTGCCCG CCGTGGCGAA GAAGGCGACG GTCGACGTGG CGACCGCCGA CAGGGCCGCC
GGGACCAAGG TGGCCGGGCG CCCGGGCCTG GAGCCTGACG TGGGCCAGGC CTCGGGCTGA
 
Protein sequence
MDRTNRDSRA EREAPTDPDS QAARVDEDEA DRAGLSEEND LTIPSDGPAR SGGAATAGGV 
ARSDGTDRMD RGERVDRERR AEAPAGLPTP PQTGRFRNPA TGLPRLWVEA VVLVGLYYVY
TATRGVAGSS VGAATDMGWD ILRLQQHLHI DIELSLNRWL QSIPPLAVAC CYYYSTLHFV
VTPALLVWMY RRHPGRYIRA RWALVFTTLI SLCGFFLFPT APPRLLPGTS YVDTMSHFEA
WGWWSGGASA APDGLEGLAN QYAAMPSLHC AWALWCGFML ARFARTPLVR VIGCLYPAAT
VFVVMATSNH YILDAVAGWA VLGVSTLLSL AITARGRRRP AEAPPTPAPA AAVVHRPAAV
PLPAVAKKAT VDVATADRAA GTKVAGRPGL EPDVGQASG