Gene Franean1_6364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6364 
Symbol 
ID5674680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7725346 
End bp7726779 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content72% 
IMG OID641245213 
Producthypothetical protein 
Protein accessionYP_001510608 
Protein GI158318100 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTGA AACGAGCAGT CCGGAGCGCG GCGCTGGTGG CGGTGGCCGC CGCCACGTTC 
GCCGGCTGCG TACCGGTCAG CCCGGGCGGC GGAGGCGGCT CCGTCCCGAC CGCCTCGCCG
TCCATCCCCC CGACCGCGCC CTCGTCGCCG TCCGGCACTC CGGCAGGCAC TCCGACCGCC
TCGCCGTCGG TCCGGCCGGC CGTGCCGTCG TCGCCGACCG GCACCCCGGT GGGCACCCCG
ACGCCTTCGC GGACCACGAC CCCGTCCGGA CCGACGTCCG GTGACTGCGG GCGGACGTCG
GGAGCCTCAC CGGCGACGAG GATCACCGAG GTCGGCCTTG GGTCGCCGGT GGTCAGCTTC
GCGCCGCAGG GCGACACGGA CCCGTTGCCG ACCGCGATCG CCGCCGCGCC GGGCGGGGGA
TCCTGGCTGG CCTGGCTGGG CACCGACTCC AAGGTGCGCC TCGGCAAACT GGACTGCGGA
GACCAGCTGG TCGGTACTCC CACATCCCTC GACGCGGTCG ACCTGCAGGA CGTCAAGGCC
GATGCGGACG GTGTCGTGGT CCTGCTGACC CGTCCCGGCC CGCAGGGCAG CGGGACGCTG
TGCGGCGGGA CGTCCAGCCC GACCAGGACC ATGTGGATGG TCCGCCTCGA CAACACCGGC
AGACAGCTGT GGGAGCGTCA GGTCACCAAC CTGAGCAGCA GCCGCGGCGG CTACGACCCC
GGGGCTCTGT TCGTCTGGTG GTACAACCAC CACGGCACCC TGGCCTACGA CGGCACCAAC
TACGCCGCCT ACTTCGAGGC GGCGATCACC GTGGCCAACG GCGGCTGCGT CGACATCCAC
GAGGGTGACC GGATGCAGGT GGTCAACGCC GCCACCGGCG CCCTGGTCTC CGGGCACGAC
AGCTTCGACT GGGGCTGCAG CCACGCCTGG GACTCCCACA TCATCTGGGA CGCCCGCACC
GGCCACTTCG CGATGGTCTG CGCCACGGAC AACAACTGCC GCATCGCCCG CCCCGGCACC
GGCCAGACCG TCGTGCCCGG GGTCTGTGAC GGAACCCTGT TCGGCGGCAA CATCGTGCTG
GCCGGCACCC CGGGCTACTG GACCGCGTGG AGCAACGGCA ACCAGGTACG GCTGGAGCAC
TTCTCCACGG GCGCGTCCGA CCGGACGGTC CTCACCGCCG ACCGGACCCA GCACTCGCAC
CTGGTCGGCT ACGGCGCCGG CAGGATGCTC CTGACCTGGA AGTCCGGAAC CTCGACCGCC
GCCCAGGCGT ACGACACCAC CAGTGGCGGC ACCGTGGGCG GCCAGTTCAC GATCGCCGTG
CCGGACCACA CCTACGTCGA GGCCAAGGCC TACCCCGACG GCAGCGTCGC CTTCCCCGCC
GCCGGCACTT CCAACACGTC CATCCGGGTC GTTCGGATAA TGCCGCTCAC CTGA
 
Protein sequence
MSLKRAVRSA ALVAVAAATF AGCVPVSPGG GGGSVPTASP SIPPTAPSSP SGTPAGTPTA 
SPSVRPAVPS SPTGTPVGTP TPSRTTTPSG PTSGDCGRTS GASPATRITE VGLGSPVVSF
APQGDTDPLP TAIAAAPGGG SWLAWLGTDS KVRLGKLDCG DQLVGTPTSL DAVDLQDVKA
DADGVVVLLT RPGPQGSGTL CGGTSSPTRT MWMVRLDNTG RQLWERQVTN LSSSRGGYDP
GALFVWWYNH HGTLAYDGTN YAAYFEAAIT VANGGCVDIH EGDRMQVVNA ATGALVSGHD
SFDWGCSHAW DSHIIWDART GHFAMVCATD NNCRIARPGT GQTVVPGVCD GTLFGGNIVL
AGTPGYWTAW SNGNQVRLEH FSTGASDRTV LTADRTQHSH LVGYGAGRML LTWKSGTSTA
AQAYDTTSGG TVGGQFTIAV PDHTYVEAKA YPDGSVAFPA AGTSNTSIRV VRIMPLT