Gene Franean1_5843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5843 
Symbol 
ID5674166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7088595 
End bp7089779 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content74% 
IMG OID641244693 
Productputative integral membrane protein 
Protein accessionYP_001510095 
Protein GI158317587 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00448187 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.436736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACG GAGCCGGTCA GACCCCTCCC GGCACGCCCG GACCGGAACA GCCCGGCTCG 
TGGGGTGCGC CCTCCCCGGG CCCGCAGCAC CCACCCCCCG GTTCGTGGGA TCAGCCACCA
CCCGGCCCCG GAAGCGGCGC GCCCACGCCC GGCGAATGGC CTGCGCCCGG CGGCTGGCAG
GCCCCCGGCA GCGACCCCTC CGCGGCGCCG GTCCCCGGAC AGGGCGCTCC CCCCGGCTAC
GTCCCGAACA GCCCGAACAG CGGGTACGGG CCGGGCGGGT ACGGGCCGCA CGGTGGCTAC
GGCCAGCAAC CCGGCCCGGC CCGGGGCTAC GGCGCGTGGG GCCCGAACGG GTGGGGCGGT
CCGCCGATCG CACCCAAGCC CGGCGTGATC CCGCTGCGCC CGCTGGCGGT CGGCGAGATC
CTCGACGGGA CCTTCGCGAC GATCCGGTCG AACCCCGGCG CCACCCTCGG CCTTACCCTC
GGCGCCACCG CCGTGGTCGA GACGATCAGC ACGGTCGCCG CGATCGCCGC CGAGGAGATG
TCCAATACGG CGGCAACGGT ACTGACGCTG CTGCTCTTCG GCCTGAACGC GGCGCTCGGG
ATCTTCCTCT CCGGGGTGCT CGCGGTGGTG GTGAGCGAGG CGACGCTCGG CGGGCGGATC
ACCGCGGGCG ACGCCGTCCG CCGGGTCACC CCCCGGCTGG GCGGTCTGCT GATGCTGACC
CTGGCGGTCA CGCTGTTGAG CGCGCTCGGC CTGGTCGCGC TGATCGTGGG GGCGGTCGTG
GTCGCCGTCT ACCTGAGTCT GGCCACACCG GCCTACGTGC TCGAGGCGCA GTCGACCGGC
GACGCGCTTC GGCGCTCGTG GCGGCTGGTC AAGGGATCGT GGTGGCGGAC GCTCGGCGTT
CTCCTCCTCT CCGCGGCGGT CGGCGGGGTC CTGATGTTGA TCTTCGCGAT CCCGACCAGC
GTGATCCTCA TGTCGTCCGA GCAGACGTTC GGCAGCCTGG TCGAGGGAGA CCTGACCGTG
GCCGGGCACA TCGTCAACGC GATCGGCAGC CTGCTCGCCA CGACGGTCGC GACTCCGGTG
CTCTCCGGGG CCGTCGTCCT CCTCTACATC GATCGGCGCA TCCGCCGCGA GGGGCTGGAC
GTCACCCTCA CCGAGGCGGC CCGCCAACGC GCGGCCACTC CGTGA
 
Protein sequence
MTDGAGQTPP GTPGPEQPGS WGAPSPGPQH PPPGSWDQPP PGPGSGAPTP GEWPAPGGWQ 
APGSDPSAAP VPGQGAPPGY VPNSPNSGYG PGGYGPHGGY GQQPGPARGY GAWGPNGWGG
PPIAPKPGVI PLRPLAVGEI LDGTFATIRS NPGATLGLTL GATAVVETIS TVAAIAAEEM
SNTAATVLTL LLFGLNAALG IFLSGVLAVV VSEATLGGRI TAGDAVRRVT PRLGGLLMLT
LAVTLLSALG LVALIVGAVV VAVYLSLATP AYVLEAQSTG DALRRSWRLV KGSWWRTLGV
LLLSAAVGGV LMLIFAIPTS VILMSSEQTF GSLVEGDLTV AGHIVNAIGS LLATTVATPV
LSGAVVLLYI DRRIRREGLD VTLTEAARQR AATP