Gene Franean1_2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2098 
Symbol 
ID5670498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2522073 
End bp2523653 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content72% 
IMG OID641241019 
Producthypothetical protein 
Protein accessionYP_001506440 
Protein GI158313932 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.655673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGGTCG CGGCAGCGGC CGCCTGGGAC GAGGTCTTCG ACGCGGCGCA TCGCCCCCGG 
GAGGTGTACA CCGCCCTGCA CGACGCGCTG CAGCCGCTGA GCAGCTCCGA CCTCGCGGCC
CGCAAGATCG CGCTCGACCG TGCCTTCCGG GACGCCGGCA TCACCTTCAA CCTGTTCGGC
GAGGAGCGGC CGTTCCCGCT GGACCTGGTG CCCAGGCTGC TCTCCTGCGA CGAGTGGGAC
GTCATCGAGC GGGGCGTGAC CCAGCGGGTG CGCGCGCTCG AGGCGTTTCT CGACGACGTC
TACGGGCGTG CCGACGTCCT CGCGGACGGC ATCGTGCCCC GCCGGCTGGT GCTGTCCAGC
TCGCACTTCC ACCGTGCGGC GCACGGCATC GACCCGCCGA ACGGCGTCCG CGCGCATGTC
AGCGGCATCG ACCTGGTGCG GGACGAGCGC GGCGACTTCC GGGTGCTCGA GGACAACGTC
CGCGTCCCGT CCGGGGTCAG CTACGTCATC GAGAACCGGC GCGCGATGAC CCGGGTGTTC
CCGGAGCTGT TCTCCACCCA CCGGGTGCGC CCGGTCGCCG ACTACGCCAC CCACCTGCTG
CACGCGCTGC GCGCGGCGGC GCCACCGGAG GTCGCCGACC CGACCGTCGT GGTGCTCACC
CCGGGCGTGT ACAACTCCGC CTACTTCGAG CATGCGCTGC TGGCCCGCCA GATGGGCGTG
GAGCTGGTCG AGGGCCGGGA TCTCTCCGTC CGGAACAACC GGGTCACGAT GCGCACCACC
GAGGGTGACC AGCCGGTGCA CGTTGTCTAC CGCCGGGTCG ACGACGACTG GCTCGACCCG
CTGCACTTCC GTCCCGAGTC GATGGTCGGC TGCGCGGGGC TGCTCAACGC GGCCCGGGCT
GGGAACGTGA CGATCGCGAA CGCGGTCGGC AACGGGGTCG CCGACGACAA GCTGATGTAC
ACCTACGTCC CGGACCTCAT CCGTTACTAC CTCGGTGAGG AGCCGGCGCT CGGCAACGTC
GACACCTTCC GGCTGGAGGA CCCGGACCAG CGCGCCCATG TGCTGGACAA CCTCGAGTCC
CTGGTGGTCA AGCCGGTGGA CGGCTCCGGC GGCAAGGGGA TCGTGATCGG CCCGCAGGCG
ACCGAGGCCG AGCTGGTCGC GCTGCGCGCG CGGGTGCTCG CCGACCCGCG CGGGTGGATC
GCACAGCGGG TGGTGAAGCT GTCGACCTCC CCGACCCTGG CCGATGACCG CCTCGGGCCG
CGCCACGTCG ACCTGCGGCC GTTCGCGGTG AACGACGGGA ACCGGATCTG GGTGCTGCCC
GGCGGGCTGA CCCGGGTCGC GCTGCCCCGC GGCAGCCTGG TCGTGAACTC CAGCCAGGGC
GGCGGTTCGA AGGACACCTG GGTGCTCGCC CCCGAGCGGG TCGGTCCGGA GGGGGCCGCG
CTTCTGCGTC GGCGGCCGGG CCTGACGCCG TCGGTCGCCG CCGGGCCCGA CCTCGGCCCG
CACTCGTCCG ACGAGCAGCA GCAACAGCAG TCCGAGCAGC AGAACCAGCA GCAGAACCAG
CAGGGGGACG GGCTGTGTTG A
 
Protein sequence
MEVAAAAAWD EVFDAAHRPR EVYTALHDAL QPLSSSDLAA RKIALDRAFR DAGITFNLFG 
EERPFPLDLV PRLLSCDEWD VIERGVTQRV RALEAFLDDV YGRADVLADG IVPRRLVLSS
SHFHRAAHGI DPPNGVRAHV SGIDLVRDER GDFRVLEDNV RVPSGVSYVI ENRRAMTRVF
PELFSTHRVR PVADYATHLL HALRAAAPPE VADPTVVVLT PGVYNSAYFE HALLARQMGV
ELVEGRDLSV RNNRVTMRTT EGDQPVHVVY RRVDDDWLDP LHFRPESMVG CAGLLNAARA
GNVTIANAVG NGVADDKLMY TYVPDLIRYY LGEEPALGNV DTFRLEDPDQ RAHVLDNLES
LVVKPVDGSG GKGIVIGPQA TEAELVALRA RVLADPRGWI AQRVVKLSTS PTLADDRLGP
RHVDLRPFAV NDGNRIWVLP GGLTRVALPR GSLVVNSSQG GGSKDTWVLA PERVGPEGAA
LLRRRPGLTP SVAAGPDLGP HSSDEQQQQQ SEQQNQQQNQ QGDGLC