Gene Franean1_3347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3347 
Symbol 
ID5671718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3959043 
End bp3960662 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content66% 
IMG OID641242235 
Productband 7 protein 
Protein accessionYP_001507655 
Protein GI158315147 
COG category[S] Function unknown 
COG ID[COG2268] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.953991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCTGGT ACGGGTGGCT GCTGCTGGTC ATTTTTCTCG GCGGTGGGGG GTATCTTCTC 
TTCATCGGGC TGATAAAGGT ACCGCCGGGC CAGGTGGGAG TCGTTCGGGT ACGGTTTGCC
CCGAGCCACC CGGACGACGC ACGCCGGCGG GTGAAGGTGC ATGGCTCGCC AGGGGTGCAG
GCCGAGCTAC TGAAGGCCGA CTGGCTCTAC TTCCGGCCAC CGGTGCTGTA CCAGATCAGC
TATCGGGAAA GGACCAGGGT CCCGCCGGGA ACGATCGGCG TGGTCGTGGC GAGGGACGGG
GAGACGGCTC CGGTGCACGA GGCGCTTTCG AGGCACGTCG AATGCGACTC CTTTCAGGAC
GGGCGCGCCT TCCTCTGCAA CGGGGGCCAG AAGGGAAGAC AGCCAGCGGT GTTGCCGGTC
GGCGAGTACG ACATCAACCC GGAGCTCTTC GACGTCGTGA CGGTGGAGAC GATCGGGTCT
GGCCGGTACG GCCTCACCGA GAACGACCTG AGAGAGATCG ACGTTCAGGT CGGAACCGCC
GGAGTCGTGA TCGTCCGCGT CGGACAGATG TACGACGACA GCAACGCCGC GGTCGCGCCG
CGGGTACCCG GCCATCGCAG TTTCCAGTAC CCCTGGGCTT TCCTCGAGAA CGGTGGTTGC
CAGGGGGTAC AGGAGGAGAC GCTCCCCGGC GGCGGGAGCT ATCAGATCAA TCCGTGGTTC
GCCCGCGTGG TCCTCATCCC CACCCGGGTC CTCGTCCTGG AGTGGAGCAG GAAGCGGCAG
GAGGAGAGCC GCTTCGACTC GGCGCTCGAC CAGATCGTGG TCAACGTCGG AGGATTCCCG
ATCCGGTTCG ACATGCAGCA GACCATCCAG ATTCCGGCCA GAGCGGCGCC GCGGCTGGTG
AGCCAGTTCG GCGAGCAGGA GAAACATCCG GTCGAGCCGG ACGACGACGT GACGAGACCC
GCCCCGGTGC GCCGGTTCGT CGAGCGTGTG CTGGGGACGA CGGCCGAGGG ATATTTTCTG
AGTACGGCCA GCGAGTACAA GGTGCTCGAC TTCCTGAACA GTCACAACGA GGTTCGGCTG
GAGGTCGAGC AGAAAGTCCG CCAGGCGCTT GACGAATGGG ATATCGAGGC GGTCCGCACC
ACCCTCGGCG AGTTCGAACC ACCCGCGAAC CTTGACGAGA TCCGCCGGGC GATCGCCAGC
GAGCGGGAGC ACGCACGTAT CCACCGGCAC GAGCTGGAGA ACGCGAGAAT CAAGGCGGAG
ATCGTCCGGG TCCAGGCGGA GAGCGAAGCG GTCGCCAAAG GAATCCGGAG CACAGCGGAG
GCGGAGCACA TCCAGAAGCT GGCGGCGGCG GAGCTCGAGG CGAGGATCCA GCTTCTGGGC
CAGGATGTCG TGGCGATGGA GCTCCTCCTG GCACAGCTCT CGAAAATGAA CGTGCCGACC
TATGTCGGCG GCGACGCGAC CGCGCTGCTA CAGCACATGC CCCTGGAAGC CGCACGCGAA
ATGATCAACA ATGCGATCGC CCGCGTCCAG CAGAAGGAGG TGGCTGGCGC GCCGCCGCGT
CCCGGCCCAC CCGACGACGG CGCCGCCAGC GAACTGACAG CCGGTACCGG TCCGATCTGA
 
Protein sequence
MPWYGWLLLV IFLGGGGYLL FIGLIKVPPG QVGVVRVRFA PSHPDDARRR VKVHGSPGVQ 
AELLKADWLY FRPPVLYQIS YRERTRVPPG TIGVVVARDG ETAPVHEALS RHVECDSFQD
GRAFLCNGGQ KGRQPAVLPV GEYDINPELF DVVTVETIGS GRYGLTENDL REIDVQVGTA
GVVIVRVGQM YDDSNAAVAP RVPGHRSFQY PWAFLENGGC QGVQEETLPG GGSYQINPWF
ARVVLIPTRV LVLEWSRKRQ EESRFDSALD QIVVNVGGFP IRFDMQQTIQ IPARAAPRLV
SQFGEQEKHP VEPDDDVTRP APVRRFVERV LGTTAEGYFL STASEYKVLD FLNSHNEVRL
EVEQKVRQAL DEWDIEAVRT TLGEFEPPAN LDEIRRAIAS EREHARIHRH ELENARIKAE
IVRVQAESEA VAKGIRSTAE AEHIQKLAAA ELEARIQLLG QDVVAMELLL AQLSKMNVPT
YVGGDATALL QHMPLEAARE MINNAIARVQ QKEVAGAPPR PGPPDDGAAS ELTAGTGPI