Gene Franean1_2246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2246 
Symbol 
ID5670645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2684842 
End bp2686065 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content79% 
IMG OID641241166 
ProductOmpA/MotB domain-containing protein 
Protein accessionYP_001506587 
Protein GI158314079 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.14079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00792288 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCCTCGC CCGCGGTGGC GTTCCCGCGG CGCCCCCGTC TGGTGGCGTT CGCCGTCTCA 
TTGATCATCG CCTGGCTGCT GCTCGGGCTG GGGGCGCTCT GGCTGCGGCG CGGGCCGATC
GAGAATGACC TGACCGGCCG GGCGGCGGAC GCCGTCCGCG CCGCGGGCGC CACCCAGGTG
CGGGTCAGCG CCGAGGGCCG GGAGATCGTG CTGCACGGCC GCTTCGACAG CGCCGAGGAC
GCCCGCCGGG CGCGGGCGGC GGCCGCCGTC TCGGGCAGCA GCTCCGTCCG GCTGGCCGCG
GACGCCGTCA TCGCCTCGGA GCCGGCGCAG CCCCTCGTGG TCGGGGTGGA GCGCGCCGGC
CGAGCGCCGG GCGGGGTGGT GCTGAGTGCG ACCGTGCCGG ACAGTGCGAC CCGGGCCGCC
CTGCTGGGCG CGGCGGCGGA CGCCGCCGGC GGCGTCGTCT CCGCGACGGT CACGGTCGAC
CCGCGGGTCG CGACGCCGGC CGTGGAGGCG TTCGGTGACG TGGCGAAGGC GTTGGGCACC
GGGCCGGGCG TCCGCTCGGT GACGATCGAC GGCTCGTCGG TCGTGCTGTG GGGGAGCGCC
CCCGACGACG CGGCACGGGC GTCGATCGGG GCGGCGGTCC TGGCCGCGGC CCGGCAGTCG
ATCCCTGGCG CGTCCCTGGA GAACCGGCTT GCCGTCGGCT CGGGCGCCAC CGTGACCCTC
GACCCGGCGA CGGGCGAGAT CGTCCGGCCG GCCGCACCGG CCCCGGCCAC CGCGCCACGG
TCGGCGCCGA CCCCGGCCCC GGCGACCGCG CCGGCACCCG GCACGACCGG CGGCGCCGGG
CGGGCCGCCG CTGGCAGCGC GGACAGCACC CGGGCGGCGC TGCGGGCGGC GCTCGACGGC
ACTGCCCTCA CCTTTCCGGT GGGCGACACC GTCCTCGGCG CCTCGACGCG CTCCGGTCTG
GACAAGGTGG CCCGCGCGTT GCTGCCCGGC GACCTGACGG TGATCGTCGG CGGGCACACC
GACAGCACCG GTCCGCGGGC CCTCAACCAG GCGCTCTCGA TCGACCGGGC CCGCGTCGCG
CGCGAGTACC TGGTGATGCG GGGGGTTCCC GCGGAGCGGA TCCGTGCGGC CGGCTTCGGC
CCGGACCAGC CGATCGCGGA CAACGCGAGC ACGTCCGGCC GCGCGGCGAA CCGCCGGGTC
GACGTGACTC CGGTCGCCGA CTGA
 
Protein sequence
MSSPAVAFPR RPRLVAFAVS LIIAWLLLGL GALWLRRGPI ENDLTGRAAD AVRAAGATQV 
RVSAEGREIV LHGRFDSAED ARRARAAAAV SGSSSVRLAA DAVIASEPAQ PLVVGVERAG
RAPGGVVLSA TVPDSATRAA LLGAAADAAG GVVSATVTVD PRVATPAVEA FGDVAKALGT
GPGVRSVTID GSSVVLWGSA PDDAARASIG AAVLAAARQS IPGASLENRL AVGSGATVTL
DPATGEIVRP AAPAPATAPR SAPTPAPATA PAPGTTGGAG RAAAGSADST RAALRAALDG
TALTFPVGDT VLGASTRSGL DKVARALLPG DLTVIVGGHT DSTGPRALNQ ALSIDRARVA
REYLVMRGVP AERIRAAGFG PDQPIADNAS TSGRAANRRV DVTPVAD