Gene Franean1_4831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4831 
Symbol 
ID5673172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5769777 
End bp5771063 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content75% 
IMG OID641243687 
Productextracellular HAF 
Protein accessionYP_001509103 
Protein GI158316595 
COG category[S] Function unknown 
COG ID[COG5563] Predicted integral membrane proteins containing uncharacterized repeats 
TIGRFAM ID[TIGR02913] probable extracellular repeat, HAF family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0943414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0197145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGGCGT ATCCGTTCAG GGCTGAACCA CCGGGGCGGC GGCCCCGCCG GCGGGCGCGG 
CTCGTCCGCG CCACCGCGGC GGCCGTCGCG GCCGGTCTGC TCGGCAGCGC CGTGACCGCC
TCGCCGGCCC TCGCGGCCGC ACCGCCCGAG ACGGCCGCAC CGCCCGAGAC TGCGGCGGCC
TCCGAGACAG CCGCGGCGGC CGAGACGGCC GGTTCCCCGC GGGTGACCGT CACCGACGTG
GGAGCGCCGG ACCGGCTGGC CACGAACCTG TTCCCCGAGG ACCTCAACGA CGCAGGCGTC
GTCACGGGCT ACGGCCTGCT CGGCGGCCCG CAGCCCTTCC AGGCGTTCAC CTGGGCCGAC
GGCACGCTCA CCCTGCTGAA CGCGCCGAGC GCCGACCCGG GGGCGTTCAG CTTCCCGGTC
GCGCTGAACA ACCACGGCCA GGTCGTCGGC TTCACCACCG TCGGCGGGAC GGCGCACTCG
CTGCGCTGGG ACGGCGCGGA CCCGACCGAC ATCTCCGCCG CCGGCGGGAA CAGCCACCCG
CTCGCCGTCA ACGACGCCGG CCAGGTTCTG CTGACCGAGG GCGGCGCCGC GGCGCTGTGG
ACCGCCGGCA GCCGGGTCGC GGTCGCGCCG TTCCCGGTCA CGAACGCCGT GGGCCTCAAC
GGTTCGGGGC AGGTGTTCGG CACCGGCCGG GCCGCGGGCG CCGACGCCAC CGACCGCGCC
TTCGTCTGGA CGCCGACCGC GACCACCGAC ATCGGTCCGT TCGGGCTCAC CACCACCACG
ACCGACCTCA ACGACAGCGG CCAGCTGATC GGGTACGGCG CCTTGGCGCA GAGCCCCAAC
CGCACCCATT CCTTCGTCTG GACCCCCGCG CGGGCCGGCC GGCCGGCTGC CCTCACCGAC
CTGGGCACCC TGGCCAACCT GGAGACCGAG GCGCGCGACA TCAGCAACTC CGGCCACATC
GTCGGGCGCA GCGGCACCCG CTCCGGCTGG CACGCCGTGC GCTGGCAGGG CGGCCGTCTC
GTCGACCTCG GTGTCCTGCC CGGCGGGACG TCCAGCGAGG CGCTCGCCGT CAACGAGACC
GGACAGGCTG CCGGCTGGGG CATCGCTGGC GACGGGCGGC CACATGCCAT CCTCTGGAAA
AGGGACCGCC CGATCGACCT CGGGGTGCCC GCCGGTTTCA CCCAGAGCTT CGCAATAGAC
ATCAACGCCG CTGGCAGGGT GCTCGGCTAC GCGATCGACG AGACGGGCGG CGTCCACAGC
TTCGTCTGGA CGGTGACCGG CGGATGA
 
Protein sequence
MWAYPFRAEP PGRRPRRRAR LVRATAAAVA AGLLGSAVTA SPALAAAPPE TAAPPETAAA 
SETAAAAETA GSPRVTVTDV GAPDRLATNL FPEDLNDAGV VTGYGLLGGP QPFQAFTWAD
GTLTLLNAPS ADPGAFSFPV ALNNHGQVVG FTTVGGTAHS LRWDGADPTD ISAAGGNSHP
LAVNDAGQVL LTEGGAAALW TAGSRVAVAP FPVTNAVGLN GSGQVFGTGR AAGADATDRA
FVWTPTATTD IGPFGLTTTT TDLNDSGQLI GYGALAQSPN RTHSFVWTPA RAGRPAALTD
LGTLANLETE ARDISNSGHI VGRSGTRSGW HAVRWQGGRL VDLGVLPGGT SSEALAVNET
GQAAGWGIAG DGRPHAILWK RDRPIDLGVP AGFTQSFAID INAAGRVLGY AIDETGGVHS
FVWTVTGG