Gene Ava_2778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2778 
Symbol 
ID3681711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3432126 
End bp3433355 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content43% 
IMG OID637718124 
ProductIntegrins alpha chain 
Protein accessionYP_323286 
Protein GI75908990 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAAA CGCAAAGCTT TTCCAAGAGA ATAGATAATT CGCTTAATAC ATCTTTTGCA 
AGATCAGCAT CTTTGGTATC GAGTGCTTAT GTAAGTGATT TAAATACTGA GCTACCATCA
TCTAGTAATA GCTCTGCAAC AACACAAGTA GATTTGCCAA TTTCCATTCC TGGTTTGAGT
GCCAACCCAA ATCCTAATCC TTACTTAACT AGCGCAGCAA TTTCACCTGA TTTCAACGGC
GATGGTAAGG CAGATAAGGT TTGGGTTAAT ACTCAAACAG GTGAAATTAG AGTGAGGTTG
ATGAACGGAA CAGTCACTCA AGAAGAAGCC TCTTTAGGAA CATTTGATCT GAGTGTCTGG
ACTTATAAAA TCGCTGACTT TAACAGTGAT GGCAAGACTG ACTTCCTGTT ACGGAACAAT
GCAACAGGCG AGAATGCGAT TGCGATCATG GATGGAGCTA GAGTTGCTAA CTTTGTTTAT
CTAGACAAAG TTGATCCAGG TTGGAATGCT AGCATCGGTG ATTTTAACGG CGATCGCAAA
ACCGACATCC ACTGGAATAA TACTCAAACA GGTGAAAATG CAATTTGGCT GATGGATGGC
ACAACCGTTG TCAGTGCCAA TGTTCTGGAT ACCACAACCC CAGGGTTGAG TGCCACTATT
GTTGACTTCG ACGGAAACGG TAAGAGTGAT ATCTTCTGGC GAGATACAAC CACAGGTGCA
AACTCCGTTT GGTTTATGGA TGGTATCCAA GCTACCAAGT ATGATCTACA AGCACAAGAC
GCATCTTGGT CTTACAGCCT TGGCGATTTC AACGGTGACT TCACTACTGA TCTTCTCTGG
CGTAATACCG TCACTGGTGA AAACAAAATT TGGACAATGA ATGGCATTTT TGTCACTGAA
GGTACTTTAA ACACCCTCAG TTCCGATTGG ACAGCCAACA TTGGTGATTT CAATGGTGAT
GGCCGCACCG ATATCTTCTG GAACAACACT ACAACTGGTG CAAACACTGC TTGGTTAATG
AATGGTACAT CCGTTACCAG TGAAGCCTTT CTACCAAGTA GAAGTCCAGG TTCTAAGGCG
TATATTGGCG ATTATAACAG CGATGGCAAA TCTGATATTT ACTGGCGTGA TCAGGCAACC
GGTACAGATG CCATCTGGAC TATGGATGGT ACCTTGGCTA CTGAAACTCC TGTTACAGAT
GCTCTGACTC CAGAGTGGTA CACAGCTTAG
 
Protein sequence
MQETQSFSKR IDNSLNTSFA RSASLVSSAY VSDLNTELPS SSNSSATTQV DLPISIPGLS 
ANPNPNPYLT SAAISPDFNG DGKADKVWVN TQTGEIRVRL MNGTVTQEEA SLGTFDLSVW
TYKIADFNSD GKTDFLLRNN ATGENAIAIM DGARVANFVY LDKVDPGWNA SIGDFNGDRK
TDIHWNNTQT GENAIWLMDG TTVVSANVLD TTTPGLSATI VDFDGNGKSD IFWRDTTTGA
NSVWFMDGIQ ATKYDLQAQD ASWSYSLGDF NGDFTTDLLW RNTVTGENKI WTMNGIFVTE
GTLNTLSSDW TANIGDFNGD GRTDIFWNNT TTGANTAWLM NGTSVTSEAF LPSRSPGSKA
YIGDYNSDGK SDIYWRDQAT GTDAIWTMDG TLATETPVTD ALTPEWYTA