Gene Franean1_3851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3851 
Symbol 
ID5672214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4573273 
End bp4574370 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content70% 
IMG OID641242729 
Producthemerythrin HHE cation binding domain-containing protein 
Protein accessionYP_001508149 
Protein GI158315641 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00703779 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACC TTCATGATCT CCCGGCCCGG GTCTTCAGCC TCATAGAGTC GGGGGTGGTG 
GCCGAGTTCT CGACGGTGTC CGCGGCCGGC ATTCCGATCG ACACGCCCAC CTACTATTTT
CCGGCCGACG ACATGTCGAC GATCGACCTG GCAACGGGCC TGCCCAATCC GGCCAAGGCC
GAGCGGGTGC GGCGCACCTC GAAGGTCGGG CTCCTGATCG AGGGCCGCCC CGAGGAACCG
GTCGTGGTCA TTCGAGCCCA CGGTGCGGTG CGCGACAGCG ACATCCAGGC CAACGCCATC
CGCTATCTCG CCGAGACCGG CTACGAGGGG ATCAGCCACG GCATCACGTG GGAGGAGGCA
CGCAAGGCCG TCACCTACTG GTCTCGGATC ATCATCGAGA ACAAGCCCGA ACGGGTGTAC
TGGTGGGACA GCCACGCGGC CCTCGACGAC CCACCGCAGG TGTGGTCCGC GGCGCCCGAC
ACGGTCTACC CGACATCCGA CCCCTCGCCC GCCCGCAGGA TGAAGCGGTC GGCCTGGCCG
GTCCGTCCCT GGCAGGACGT GGCCAGGGAG GCCGTGGAGG GCGGCACTCC GGCGCATCTG
TCGGTGCTCG ACGAGGACGG CTTTCCGCTT CCGATGCGGG CGACCTCCTT CGAGCTGACA
GCCGAGGGCT TCCGGCTGGC GGTGCCCACG GGCACCCCCT GGCGGCTGCG GGGCAGGGCG
TCGCTCACCT TCGCGGGTTT CCGTACCTTC GTCGGCGACG CCGGAGCGGA CGGCGGTGCT
GAAGCGAACG GCGGTGCTGA AGCGAACGGC GGTGCTGGAG TGGACGGCGG CGACCGCGTC
CTGTTCCGCG TCGAGCGCTC GCTCCCCCAG CACCCGGCGA CACTCGACAC GAAGCAGGTT
CTCCAGCCGA GCGAGGACAC CCTCGCCAAG GCGAGGGCAC GGCTCGAGTA CGAGGCTGAG
CGGCGCGGCC AGTCCCTCCC GGTCATCCCG GCGGACCCCC CGCCACGGAC CCGCATCGCG
CTGATCCGCC GGGCGCGCAT CGCCAGCGAC GCACCGATCA CCGGCATCAC CGAGGAGCAC
GGGAACCGAA GGACCTGA
 
Protein sequence
MSDLHDLPAR VFSLIESGVV AEFSTVSAAG IPIDTPTYYF PADDMSTIDL ATGLPNPAKA 
ERVRRTSKVG LLIEGRPEEP VVVIRAHGAV RDSDIQANAI RYLAETGYEG ISHGITWEEA
RKAVTYWSRI IIENKPERVY WWDSHAALDD PPQVWSAAPD TVYPTSDPSP ARRMKRSAWP
VRPWQDVARE AVEGGTPAHL SVLDEDGFPL PMRATSFELT AEGFRLAVPT GTPWRLRGRA
SLTFAGFRTF VGDAGADGGA EANGGAEANG GAGVDGGDRV LFRVERSLPQ HPATLDTKQV
LQPSEDTLAK ARARLEYEAE RRGQSLPVIP ADPPPRTRIA LIRRARIASD APITGITEEH
GNRRT