Gene Franean1_2244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2244 
Symbol 
ID5670643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2681725 
End bp2683155 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content73% 
IMG OID641241164 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_001506585 
Protein GI158314077 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.111887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00847254 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCCCGCT CGATGATGAG CTTGATACGC GCGCCCCGAG GCGCCGTTCC GTCCGGTTCC 
GGTTCCGGTT CCGATCAGGA TCGGGCCTCC GACCGGAGGG GGTCGCCCGC CACCCGCCGT
CCGCTGGTCC GGCGCCGGTC GGAGGCCGTG CGGCCCGCCG GGATCATCCG GTCGCGGCTC
GCCGCGGCCT CCGGCGTGCT CGCCGCGGCC GTGGTCCTGG CTTCCTGCTC CTCCGGGGGC
GGGGGCACCG GCCCGGGCCA GAACGAGCCA CCGCGCCCGG CCGCGCCGAC CGTGACGGTG
ACCCCGGCCG ACGGCGCTGC CGGCATCGCG CTCACCGAGT CGATCGTGGT GAAGAGCAAC
GCGCCGCTGG CGTCGGTGAC CGTGGCCCGC GGCGCGAGCC CGACGGAGAA GACCGACCCC
GGCACGCTGG AGGGGACCTT CTCCGCCGAC CGCCGGACCT GGACGTCCGC GGGTGGGTTG
TTCTCCGACA CCCGCTACGA CATCCAGGCG GCCACCGCGC CGGCCCAGGG GCTGGACGGC
ACCAGGAACA TCGCATCGAG CTTCACCACC GGCGTCCCGG ACAAGGCGTT CAAGGTGTCG
TGGGAGCCGG TCGCCGGTCA GACCGTCGGG GTCGGCGCCC CGATCAGCCT GACCTTCAGC
GCTCCGGTCA AGGACCGCGC GGCGGTGCAG AGCCGCCTGG CGGTGAACGC CGACCCGCCG
GTCCTCGGCG CCTGGAACTG GATGTCGGAC CGGATGGCCG TGTGGCGCCC GCAGCAGTAC
TGGGCGCCCG GGACGAAGGT GCACGTGGAG GCCAACCTCG CGGGCTTCGA CTCCGGCACC
GGCTGGATCG GGGTCAAGGA CCGCTCGATG GACTTCGCGA TCGGGGCCGC CCAGATCAGC
AAGGTCGACG CGGCCACCCA CGTGATGCAG GTGTTCCAGA ACGGCCAGCT CGTGCGGACC
ATGCCGATCA GCGGTGGCAA GCCCGGGTTC CTGACCATGG AGGGCCCGCA CAACGTGCTG
GGCAAGGCCC CGATGGTGAT CATGGACTCG GCGACGGTCG GCGTGCCGAA GGGCAACCCG
GAGTACTACT ACGAAGAGGT GCAGTGGGCC GTCCACTACA CCAGCGGTGG GCAGTACGTG
CACTCCGCTC CGTGGTCAGT GGCGTCGCAG GGCCGGGCGA ACGTCTCGCA CGGGTGCGTG
AACGCCTCCC CGGCGGACGC GCAGTGGTTC TACAACTTCA GTCAGTTCGG CGACATCGTC
GACATCAGCA ACACCGGTCG CCCGGCGGAT ACCCGGCAGC TCGGCAACGA GTGGTCCGTC
CCGTGGGACA CCTGGAAGGC GGGCAGCGCG CTGCCCGTTG ACCAGCCCGC GGCCAGCGGT
GCGCTGGCGG GCGCTGCGCC CGGCGCGGGG CTGCCCGCCG GTCGGACCTG A
 
Protein sequence
MSRSMMSLIR APRGAVPSGS GSGSDQDRAS DRRGSPATRR PLVRRRSEAV RPAGIIRSRL 
AAASGVLAAA VVLASCSSGG GGTGPGQNEP PRPAAPTVTV TPADGAAGIA LTESIVVKSN
APLASVTVAR GASPTEKTDP GTLEGTFSAD RRTWTSAGGL FSDTRYDIQA ATAPAQGLDG
TRNIASSFTT GVPDKAFKVS WEPVAGQTVG VGAPISLTFS APVKDRAAVQ SRLAVNADPP
VLGAWNWMSD RMAVWRPQQY WAPGTKVHVE ANLAGFDSGT GWIGVKDRSM DFAIGAAQIS
KVDAATHVMQ VFQNGQLVRT MPISGGKPGF LTMEGPHNVL GKAPMVIMDS ATVGVPKGNP
EYYYEEVQWA VHYTSGGQYV HSAPWSVASQ GRANVSHGCV NASPADAQWF YNFSQFGDIV
DISNTGRPAD TRQLGNEWSV PWDTWKAGSA LPVDQPAASG ALAGAAPGAG LPAGRT