Gene Franean1_3923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3923 
Symbol 
ID5672284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4692495 
End bp4693607 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content67% 
IMG OID641242802 
Producthypothetical protein 
Protein accessionYP_001508219 
Protein GI158315711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00983972 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0399777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTGG GCCTGGCGTT CACCGCCTGT TCCAGTTCCA GCGACGACAC CGACGGCGGG 
GAGACGACGG CCGCGCCCGA CCAGACGGTG AACGGGGCGC CGGGGGTCAC CCAGGACGAG
ATCCGCTTCT TGGTTCTCGG CACGAAGACC AACAACCCGA CCGGCTCCTG CACGCTCGAC
TGCTTCTCCC AGGGGATCAA GGCCTACTTC GCCTTCCGTA ACAGCACGGG CGGGGTGCGC
GGCCGCAAGC TGACCGTCAC CACCGAGATC GACGACGCGC TCGGGCAGAA CCAGCAGGGC
GCGCTGCAGA TCATCACCAA GAACGACACG TTCGCCGACT TCGGCGCCGC GCAGCTGCCC
ACCGGCTGGG GTGACCTGGT CAAGGCCGGC GTCCCGCAGT ACGTGTGGGC CATCCAGCCG
CAGGCCATGG CCGGCCAGGA CTCGATCTTC GGCAACGCCG GAGTGACCTG CCTGGAGTGC
ACGAACCGGA CCTTCACGTA CGCGGCGGAG CTCGCCGGCG CGAAGAAGAT CGGCGCCCTC
GGCTACGGGG TCTCGGAGAG CTCGAAGCGC TGCACGTCGA CCATCACCGA CACGATCGAG
CTCTACCACG ACAAGACCAG CCAGGAGGTC GTGTACAAGA ACGACGACCT GGCCTTCGGC
CTGCCGAACG GGATCGACCC CGAGGTCTCC GCGATGAAGC GCGCCGGCAC CGACATGATC
ATTACCCGCC TCGACCTCAA CGGCATGAAG ACGCTGGCAC AGGAGCTCGA GCGCCAGGGC
ATGGGCGACA TCCCGCTTTA CCATCCGAAC ACCTATGACC GGAAGTTCGT CGCCGCGACG
GGCGACCTGT TCGAGGGGGA CTACATCGGC GTCACCTTCC GCCCGTTCGA CGCCGACCTC
GCCTACCAGG GAATTCTCGC CGCCGGACCT TCCTTTGACC ACGCCAAGGT CATCGCCGCG
AAGAACGCCA TGACCGAGTA CAGCGCGGAC GGCCTGATCA ATCCCAGCGA CTGGAGCCGC
CAGCACGAAA GCCCGACGCA GGACGACCCG GCGACCCACG GCTACAGGCA GGAGCGCTTC
GCCATGGTCC AGGTGCGGAA AGGAAAGTTC TAG
 
Protein sequence
MALGLAFTAC SSSSDDTDGG ETTAAPDQTV NGAPGVTQDE IRFLVLGTKT NNPTGSCTLD 
CFSQGIKAYF AFRNSTGGVR GRKLTVTTEI DDALGQNQQG ALQIITKNDT FADFGAAQLP
TGWGDLVKAG VPQYVWAIQP QAMAGQDSIF GNAGVTCLEC TNRTFTYAAE LAGAKKIGAL
GYGVSESSKR CTSTITDTIE LYHDKTSQEV VYKNDDLAFG LPNGIDPEVS AMKRAGTDMI
ITRLDLNGMK TLAQELERQG MGDIPLYHPN TYDRKFVAAT GDLFEGDYIG VTFRPFDADL
AYQGILAAGP SFDHAKVIAA KNAMTEYSAD GLINPSDWSR QHESPTQDDP ATHGYRQERF
AMVQVRKGKF