Gene Franean1_5828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5828 
Symbol 
ID5674151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7070102 
End bp7071709 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content74% 
IMG OID641244678 
Producthypothetical protein 
Protein accessionYP_001510080 
Protein GI158317572 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.647484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCTATC TCGCTGGGCA GATTCTGGTG TTCGTGCTGC TGGCGATGAT CGTCGGTGCG 
GCGCTCGCCT GGGTCTTCCT CGCCGCGCCG GTTCGGCGCC AGCAGGCCGC GCAGGCCCGT
GCGGCCAGGG CGGCCCGGGC GCGGGCCGCT CGGGGCGAGT CGGCAGGAGG GGGCGCCACC
TCGGCCTCCA CCCCCGGTCG ATCCGGCGGC GCGCCCGCGC GTGATCCCGG CGTCGACAGC
GACCGGGGCG CGCGCCACCC CGACCTCGTT GCAGCCGACC CTGTGACCGA GACAAATACC
GGGTTCGGGC AGCCTCTCGA CGACGACACC TGGCCGCCGC TGGCGCCCAG GGTCGAACAA
GCGGATCTCG CCGACCTGAT GGCGCGCCTG GGTGACCAGG AGGAGCGGTG GGGCGCGGAG
AAGGCGAGCC TGACAGCCCG GCTCGTCGCC GCGGAACAGC AGGCCCTCGA GTCCGAGCAG
CGAGTCGCGG CGGCCGAGTA CCAGGTCGCG ATCGCGCAGG CGCGGATCGG TGAGATCGAA
GCGGCGCTGC ACGCGGCACC GGCCGTCGAC GCGGGCACCG CCGGCGTCGC CCAGCCGGAG
GACGCTCCTG TCACCCCGGA CGCAGCCGCA CTCGCCGACC ACGGTGATGT CACAACTGTC
GCCGACCTTG CCCGGGAGAC GGTCAGGCTG CGTGAGCAAC TGGAGGAGGC GGAGTCGCGC
GCGGCCCGGT TCTCCTCCCG GCTCGCCATG GTCCGCACCG ACGCGGAGGC GGCGCAGCGG
CAGGTCGCGA CGATGAGCAC CCGCCTTGAC CGGCACCAGG CCGAGTGGGC GGCAGAGCGG
ATCAAGCTCC TCGCCCGGAT CGCCGAGGCC GAGGAGACTC GGCCGGCGGC ATCGCTCCCC
CCGGTATCCG TTGATGAGCC CGAGGCCATC GCAACGGTCG AGCCCGTTCA GGCCGAGAGC
GAGACGGACC GGGTCGCGGA CGAGCCGAAG GAGGTCTACG CGGTGCATGA GGCGCCGGCC
GAGCAGGAAC TGACCGAGCA GGAGCCGGTC GAGGTCGAGC CGGTCGAGGT GGAGCCGACC
GAGGAGGAGC CGGCATCATC CTCGGTCGAG CTCACCTGGG CCAGGGAGTC GATCGAGGCG
GAGCACCTGA CCTCCACGGG GCCCGGAGCC TTCGAGGCCG GCCCGCTCGT CCATGTCATC
CCGGGGTCCC GTGACGACGA CCTCGAACTG ATCGGCGCGG CGCTGGTCGG CGCCGCGGCC
TCCGCGGGCG TCACCGGGCG CCCGGCGGTG GGCACGCTCG CCGGCGGATG GAACGCGGGT
CCCGAGTCGG CGGGCCGGTT CCACCTCTTC GGGGCGTCGT GGGGAGGCCC GGCCGAGCCC
GTGCTGTCCA CCGACAACCT CAAGGAGATC GTCGGGGTCG GGCCGGTGAC CGAGTCGCGG
CTGCGGGTCC TGGGCATCAC GACGTTCCGC CAGCTGGCCA CGATGGGCGA CACCGACGTC
GACCGGCTGG CGAAGAAGCT GGACGGGTTC GGCGATCGGA TCGTCACCGA CGACTGGGTC
GGTCAGGCGC GTGACCTTCA GGCCCGGCAC CACAGCGGCC TGGCCTGA
 
Protein sequence
MLYLAGQILV FVLLAMIVGA ALAWVFLAAP VRRQQAAQAR AARAARARAA RGESAGGGAT 
SASTPGRSGG APARDPGVDS DRGARHPDLV AADPVTETNT GFGQPLDDDT WPPLAPRVEQ
ADLADLMARL GDQEERWGAE KASLTARLVA AEQQALESEQ RVAAAEYQVA IAQARIGEIE
AALHAAPAVD AGTAGVAQPE DAPVTPDAAA LADHGDVTTV ADLARETVRL REQLEEAESR
AARFSSRLAM VRTDAEAAQR QVATMSTRLD RHQAEWAAER IKLLARIAEA EETRPAASLP
PVSVDEPEAI ATVEPVQAES ETDRVADEPK EVYAVHEAPA EQELTEQEPV EVEPVEVEPT
EEEPASSSVE LTWARESIEA EHLTSTGPGA FEAGPLVHVI PGSRDDDLEL IGAALVGAAA
SAGVTGRPAV GTLAGGWNAG PESAGRFHLF GASWGGPAEP VLSTDNLKEI VGVGPVTESR
LRVLGITTFR QLATMGDTDV DRLAKKLDGF GDRIVTDDWV GQARDLQARH HSGLA