Gene Franean1_2947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2947 
Symbol 
ID5671333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3468102 
End bp3469235 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content68% 
IMG OID641241853 
Producthypothetical protein 
Protein accessionYP_001507273 
Protein GI158314765 
COG category[S] Function unknown 
COG ID[COG3662] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.985948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC ACACCTCAAC CATCGAACCC ACCGCTGCGC CGCGGACGCT GACCGTCCCG 
ACAGCCGCCG CTCCGGTCCG TTCGGAAGAA GTCGACTGGG CGCTCGGTCC CGGCTCGGTT
ACCTGGGAAG TCATGAAGGA CCCTGCCGTG TTCCTGGTCG GGCTGCTTCG AGAGGCCATT
CTCCTGACGC TTCACCCACC GTTCGCCGCC GCCGCGATAG ATCATGACAG CTTTCTGGAT
GACCCGGTGA TGAGGTTCCG GCGAGTGGCC ATGTACGCCT ACTCGGCCAC GTACGGCACC
AAGGCTGACG CCGAGAGGGT CAGCGCGATG GTGCGTCGGC GGCACTTCCA GATCGTGGGC
GTCGAGCCTC TGAGCGGCGA GCCGTACCGG GCAGACTCGG AGTACGAGCT GGCATTGACC
CAGGCCATGC TGGCGGCGTC GTTCCTGGCG GTGTACGAGG AAGTCCACGG CCGCCTCTCC
ACCGCCCGAC GTGACCAGTT CCTCATGGAG CAGAAGGTGC CCGCGGCGCT GCTGGGCGTA
CCGCCCGAGC ACATGCCCTC GACGTGGGGT GATCTTCAGC GGTTCCTCGC CAGGGCGCGG
GACGGCTTCG CCACCGGCTA CCAGGCCCGG GAGATCATCG ATCCCTTCTC CCGAGGCAGC
TACCCGCCTG GTAGCGTGCT CGGGGACCTG CCGACACTGA AGCGACAGGC CGCTATGTGG
CTGATCCGGG CGATCGCCGA CATGGCCATT CTCACCATGA ACGATGAGGA ACGCGCCCTC
CTCGCGATCG ACCGACGGCC CAAGCTGCGG TCGCAGGCAG CAATCCGGCT CTCGCTCAGG
GCACTGTCCC GGTACCTGCG TAGCGAGAAG GGAACGCTCG CCTTCGAGGG GTTCGTCAAG
GCGAACACCG CGAAGATCAT GCGGCGAGCC TTCGAGGTCG ACAGGAAGCC GGGCCGCCGT
GCCCGGGAAA AGGCATTCCG GGTCCCGGAC GCCGCCGGCT TCGTCGTCCA GCTGCCCGAC
CTCGTGCACA ACTGGCCTGG CTCCCGGAGC ATCGCCGAGC AGCCGAAGCC AGTGGAGGGC
CCGTCGGCTC ACGAGACTCG AGCCGGGGCC GGGAGAGCCC GCCGCGCCGG ATGA
 
Protein sequence
MTDHTSTIEP TAAPRTLTVP TAAAPVRSEE VDWALGPGSV TWEVMKDPAV FLVGLLREAI 
LLTLHPPFAA AAIDHDSFLD DPVMRFRRVA MYAYSATYGT KADAERVSAM VRRRHFQIVG
VEPLSGEPYR ADSEYELALT QAMLAASFLA VYEEVHGRLS TARRDQFLME QKVPAALLGV
PPEHMPSTWG DLQRFLARAR DGFATGYQAR EIIDPFSRGS YPPGSVLGDL PTLKRQAAMW
LIRAIADMAI LTMNDEERAL LAIDRRPKLR SQAAIRLSLR ALSRYLRSEK GTLAFEGFVK
ANTAKIMRRA FEVDRKPGRR AREKAFRVPD AAGFVVQLPD LVHNWPGSRS IAEQPKPVEG
PSAHETRAGA GRARRAG