Gene Franean1_6577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6577 
Symbol 
ID5674892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8001932 
End bp8003470 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content75% 
IMG OID641245428 
ProductCHAD domain-containing protein 
Protein accessionYP_001510820 
Protein GI158318312 
COG category[S] Function unknown 
COG ID[COG5607] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.483956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCTCTT CGCGGGAGAT CGAACGCAAG TACTCCGTCG ACCAGAGCTT CGTACTTCCC 
CGGCTGACGC AGGTGGACGG GGTCGCTACC GCCCGCACCC GCCGCACCGT CACCCTGGAG
GCCGTCTACT ACGACTCCGA CGACCTGCGG CTGGCCCGCA ACCGCATCAC CCTGCGCCGC
CGGACGGGCG GGGACGACGC CGGCTGGCAC CTGAAGCTCC CCGTCGGCCC AGGGGTGCGT
GACGAGATCC ACCGCCCGCT CGGCTCCCAC GATCCGGTTC CCGACGATCT TGTCGACCTC
GCCACCGTGC ACCTGCGCGG GGCGGAGCTG CGCCCGGTCG CCCGCCTGGT GACGCTGCGC
ACCGCCCGCC GCCTGCGCGA CAAGGCGGGC TGTGACCTGG CGGAGGTCGT CGACGACCAG
GTGACGGCCC AGACGCTCGG CGCGACCACG GTCGTGCAGA AGTGGCGGGA GATCGAGGTC
GAGCTGGGCT CCGGTGACCC GGGCGTGCTC GACGCCGTCG AGGCCGTGCT GACCGCCGCC
GGCGCCGAGC TGTCGGCCTC GGCGTCGAAG CTGGCCCAGG TGCTCGGCCC GGCGCTGAGC
GGCGCGCCCG GCCCGGACGT GCCGCGCGCC GCGGCGAAGC TGCGCCGGCG GACCCCGGCC
GGCGAGGTCG TCCGCGCCTA CCTGGTCGCG CAGGTCGAGG CGCTGCTCGC CACCGACCCG
CGGGTGCGCC TGGACGCGCC CGACGCGGTG CACCGGATGC GGGTGGCCTG CCGGCGGACC
CGCAGCACCC TGCGGACCTT CGCGCCCTTC TTCCCCGCGG AGCTGGTCGC GCACCTCGAC
GTCGAGCTGC GCGACCTGGC CGGGGCGCTG TCCGCCGCGC GCGACGCCGA GGTACAGATC
GCCTACTTCG CGGGCCGGCT CGATGACCTG CCCGCCGACC TGCTGCGCGG GGACGTCGTG
GGCGCGATCT CGGCCCGGCT CGCGGCCGAC CAGACGGCTG CCCGGGCCCA GGCGCTGGAG
ATGCTGCGCA GCGAGCGTTA TCTGACCCTC GTGGAGGATC TGCTCGCGCT CGTCAACGGC
CCGTTCGTCG GGCGTTCGGG CAAGCCTGCG GAGAAGGTGC TACCCGGTCT CCTGCACGAC
GCCGACCGGC GGCTGAGCCG CAAGGTCAGC CGGGCCGCCC CCCTGCCGGT GGGCACCGAA
CGTGACGAGC TCCTGCACTC CGCGCGCAAG CAGGCGAAGC GGCTGCGCTA CGCGGCCGAG
GTGGTCGGGC CGGTGTACGG CCCGCCCGCG GCCGCGTTCG CCCGGCTCGC CGAGTCGATG
CAGGAGCTGC TCGGCACCCA CCAGGACGCG ACGATCGCCC GCGGCCTGCT GCGCGACTGG
GGGGTCGAGG CGCAGACAGC CGGCGAGCCG ACCGCTTACA CCCTGGGCGT ACTGCTCGGG
TTGGAGGAGT GCCGCGCCCG CACCGCCGAA CGGGACTTCT TCGACCTGTG GCCGGACGCG
TCGGCGCGCC GGCACCGGCG CTGGTTCGCC CGCCGCTGA
 
Protein sequence
MGSSREIERK YSVDQSFVLP RLTQVDGVAT ARTRRTVTLE AVYYDSDDLR LARNRITLRR 
RTGGDDAGWH LKLPVGPGVR DEIHRPLGSH DPVPDDLVDL ATVHLRGAEL RPVARLVTLR
TARRLRDKAG CDLAEVVDDQ VTAQTLGATT VVQKWREIEV ELGSGDPGVL DAVEAVLTAA
GAELSASASK LAQVLGPALS GAPGPDVPRA AAKLRRRTPA GEVVRAYLVA QVEALLATDP
RVRLDAPDAV HRMRVACRRT RSTLRTFAPF FPAELVAHLD VELRDLAGAL SAARDAEVQI
AYFAGRLDDL PADLLRGDVV GAISARLAAD QTAARAQALE MLRSERYLTL VEDLLALVNG
PFVGRSGKPA EKVLPGLLHD ADRRLSRKVS RAAPLPVGTE RDELLHSARK QAKRLRYAAE
VVGPVYGPPA AAFARLAESM QELLGTHQDA TIARGLLRDW GVEAQTAGEP TAYTLGVLLG
LEECRARTAE RDFFDLWPDA SARRHRRWFA RR