Gene Franean1_2802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2802 
Symbol 
ID5671191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3316962 
End bp3318575 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content68% 
IMG OID641241711 
Producthypothetical protein 
Protein accessionYP_001507131 
Protein GI158314623 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC CCAGCCAAAG GGTTGAGCAG GACGAGTTGC GGACTCGGAT GCGCGCGACC 
GGCATGAGCC ACCACGAGAT CGCGATCGAG TTCGCCCGCC GCTACCGGCT ACGCCCCCGC
GCTGCCTACC GTGTCGCCCA CGGCTGGACA CAGCAGCAGG CCGCCGACCG CATCAACGCC
CACGCCGTCC GTGCCGGCCT CGACCCGGAC GGCACCGCCC CGATGACCGC GCCGCGGCTA
TCGGAGGTGG AGAACTGGCC TCGCCCTGCC CGGCGACGTC CCACCCCGCA GATCCTCGCC
CTGCTTGCCG AGGTGTACGG ATGCGATCTC CACGCCCTCG TCGACGTGGA CGATCGCGAA
CACCTCCCTC CGGCAGACGT GTTCCTGATC AACGGCATGC GCCGGCTGCC GGACGGCGTG
GCTGCGTCAT CGACGGCCTC TCCGATCACA CTCGCGGGGA AACGATGGGG AACGACCACC
GAACGACCGG TTGACGTCGC CTTCGGCGCC GCCAGAGCTG GGCAAGCACC GGCCCCCAAC
TCGGTCGGTC TCGCAGACAA AGACAACGTG ATCATTTTTC CGCAACTCGC CCCGGATGGG
AGGATCGTCC TCATGCCACT CGATCGCCGG GGTTTCCTGA GCGGCCTGGG CCTCACCGCC
GCCAGCAGCG CAGCCCTCAG CCCGCTCGCC ACGATGCCAC CAGGATCGTC CTCCATCGAC
CCACGTGTCG TCGATCACTT CGCACGCCTG CGGGCCGTGC TCGCGGAGAA CGACAACCTC
TTCGGGCCGC GCCAGGTCAT CACCACAGCG CAGGAACAGG CCGGCCTCAT CGCCGCCCAT
CTCCGCCACG GCACGAGTTC GCGTCCACAA CGGCAGACCC TGCTCCACAT CCAAACGCAG
TTCGCTGATC TCCTCGGCTG GCTCCATCAA GACAGCGGCG ATAACGCCAC CGCTGGATAC
TGGCTCGACC GAGCGCTGGA ATGGTCACAC CGAGCGAGCG ACCCCAACGC CACCGTGTTC
ATCCTTGCCC GCAAAAGCCA ACTCGCCGCA GACCGTGGCG ACCCAGCTGA AGCCGTCGAC
GTCGCCGACG CAGCCCTGAC CAGCGCGGAG CCAACCGGTC GTCTCGCCGC CATCGCCGCG
ACCTACAGCG CGCATGGCCA CGCACTACGC GGCGAGAAAA CCACCTGCCT GACGCTCTAC
GACCGCGCCC ACGACATCCT CGACCAGGCC GGACCCGACA CCGACCCCTG GGGCGAGTTC
TTCAGCCCTG CCTACATCGA AGTCCAGCGG GCACACAGCC TCGCCGCACT CGGTGACTAC
CCTGCTGCAG CCACCGGGTT CCGGACCGCG ATCGACGGTC TCCCCTCGGC TTTCCACCGC
GACCGCGGTG TCTACCTCGC CCGCGAAGCC CTCGCACACG CAGGAGCACG CGAACCAGAG
CAAGCCGCGA CACTCGGTCT CAACGCACTC ACGGTCGGCG CCAGCACGCA CTCCGGATGC
ATCATGACCA GTCTGCGGTC CCTGCGTGAC GCCGTCGCCG GATGGCAAAC TGTCTCCCAG
GTACGCGAGT TCCGCCAGGC GATGGACCAG GTCCCCACGG CCATCACCGT CTGA
 
Protein sequence
MNKPSQRVEQ DELRTRMRAT GMSHHEIAIE FARRYRLRPR AAYRVAHGWT QQQAADRINA 
HAVRAGLDPD GTAPMTAPRL SEVENWPRPA RRRPTPQILA LLAEVYGCDL HALVDVDDRE
HLPPADVFLI NGMRRLPDGV AASSTASPIT LAGKRWGTTT ERPVDVAFGA ARAGQAPAPN
SVGLADKDNV IIFPQLAPDG RIVLMPLDRR GFLSGLGLTA ASSAALSPLA TMPPGSSSID
PRVVDHFARL RAVLAENDNL FGPRQVITTA QEQAGLIAAH LRHGTSSRPQ RQTLLHIQTQ
FADLLGWLHQ DSGDNATAGY WLDRALEWSH RASDPNATVF ILARKSQLAA DRGDPAEAVD
VADAALTSAE PTGRLAAIAA TYSAHGHALR GEKTTCLTLY DRAHDILDQA GPDTDPWGEF
FSPAYIEVQR AHSLAALGDY PAAATGFRTA IDGLPSAFHR DRGVYLAREA LAHAGAREPE
QAATLGLNAL TVGASTHSGC IMTSLRSLRD AVAGWQTVSQ VREFRQAMDQ VPTAITV